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INTRODUCTION 


Unaer  contract  to  the  Army's  Night  Vision  and  Electro-Optics  Laboratory. 
Westinghouse  has  been  investigating  the  design,  test,  and  implementation  of  a  set 
of  algorithms  to  perform  intelligent  tracking  and  intelligent  target  homing  on  FLIR 
and  TV  imagery.  Research  has  been  initiated  for  the  development  of  an  intelligent 
target  tracking  and  homing  system  which  will  combine  target  cueing,  target  signa¬ 
ture  prediction,  and  target  tracking  techniques  for  near  zero  break  lock  perfor¬ 
mance  The  intelligent  tracker  will  monitor  the  entire  field  of  view,  detect  and 
classify  targets,  perform  multiple  target  ttracking,  and  predict  changes  in  target 
signature  prior  to  tne  target's  entry  into  an  obscuration.  The  intelligent  track¬ 
ing  and  himing  system  will  also  perform  target  prioritization  and  critical  aimpoint 
selection.  Through  the  use  of  VLSI/VHSIC  techniques  the  intelligent  tracker  (with 
inherent  target  cuer)  can  be  applied  to  the  ful ly-autonomous  munition. 

During  the  fifth  quarter,  several  meetings  and  a  number  of  phone  conversations 
took  place  between  Westinghouse  personnel  and  CRT.  Reischer  of  NV&EOL.  The  system 
concept  was  described.  The  problem  of  target  aspect  Determination  in  support  of 
aimpoint  selection  was  analyzed.  Sets  of  875  line  FLIR  imagery  were  extracted  from 
the  NV&EOL  data  base  and  an  example  of  aspect  determination  for  a  maneuvering 
target  in  the  presence  of  obscurations  was  presented.  An  example  was  also  examined 
for  close-in  homing  (less  than  500  meters)  and  the  emergence  of  interior  features, 
target  movement,  and  scale  changes.  Hardware  implementation  in  terms  of  VLSI/VHSIC 
chips  .vas  also  analyzed. 

Westinghouse  personnel  purt icipating  in  this  effort  include  Thomas  Willett, 


Program  Manager,  Joseph  Abruzzo,  Vincent  Zagardo,  John  Shipley,  Leo  Kossa,  and 
Riciiara  Kroupa.  Program  review  and  consultation  is  provided  by  Drs.  Glenn  Tisdale 
and  Azriel  Rosenfeld. 


1.0  SYSTEM  CONCEPT 
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Tnfc  purpose  of  tins  seclion  is  to  aescribe  the  intelligent  tracker  concept  that 
has  evolved  during  the  course  of  this  work.  This  Fifth  Quarterly  Report  marks  the 

end  of  the  analysis  ano  design  phase;  hence,  the  system  concept  outlined  below  will 

be  employed  during  the  test  phase.  For  the  test  phase,  to  be  conducted  during  the 
next  quarter,  several  video  tape  sequences  will  De  prepared  to  exercise  the  intel¬ 
ligent  tracker  functions  and  tne  system  concept.  The  oesired  intelligent  tracker 
functions  are: 

1.  acquisition  and  track  initiation  -  oetect,  locate,  classify  ana  prioritize 

targets  automat ical ly  ^nd  hand  off  to  the  internal  tracker  (the  intel¬ 

ligent  tracKer  concept  is  assumed  to  incluae  Doth  acquisition  ana 

f' 

tracking ) ; 

2.  handle  multiple  targets  -  track  a  number  of  targets  in  a  scene  simul¬ 
taneously; 

3  target  signature  prediction  predict  or  anticipate  target  obscuration  and 
how  the  target  signature  will  change  as  a  result  of  the  obscuration; 

4.  reacquisition  -  reacquire  a  target  at  the  earliest  opportunity  following 
track  breaklock,  incluoing  departure  from  the  field  of  view; 

5.  aimpoint  selection  -  determine  the  critical  aimpoint  of  a  target  (which 
may  be  an  interior  point  within  its  silhouette). 

A  video  timing  oiagram  for  the  intelligent  tracker  is  siiown  in  Figure  1-1. 
Figure  1-2  shows  how  an  image  would  appear  in  the  frame  memory.  A  block  diagram  of 
the  system  concept  is  shown  in  Figure  1-3. 
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Tne  heavy  vertical  lines  in  Figure  1-1  represent  tne  cued  frames  and  the 
lighter  vertical  lines  are  the  tracked  frames.  Frame  1  is  a  cueing  frame; 
frames  I,  3,  4,  and  5  are  tracking  frames,  and  then  the  cycle  repeats.  Frame  b  is 
a  cueing  frame,  and  frames  7,  8,  9,  and  10  are  tracking  frames.  Thus,  the  complex 
target  cueing  function  is  performed  at  one-fifth  the  rate  ot  the  simpler  tracking 
function. 

Figure  1-2  represents  tne  image  in  the  frame  store  wnich  is  divided  into  five 
horizontal  and  aadressaPle  strips.  This  is  done  so  that,  instead  of  waiting  for 
the  entire  rrame  to  be  cueu  uefure  handover  to  a  tracker,  targets  can  be  fianded 
over  more  rapidly.  The  reduction  in  handover  lag  increases  the  confidence  that 
Doth  cuer  end  tracker  are  working  on  tne  same  target,  ana  reduces  the  size  of  the 
track  window.  The  frame  store  RAM  is  organized  into  five  independently  addressable 
strips  to  aia  in  sending  tne  gray  levels  surrounaing  the  target  to  the  tracker  for 
a  reference  image  and  to  decrease  the  access  time.  It  should  be  pointed  out  that 
the  five  trac'-ed  rrames  per  cuea  frame  ana  five  strips  for  frame  storage  are 
initial  estimates  serving  as  a  strawman  concept.  The  point  of  this  discussion  is 
that  the  cuer  results,  labelled  CR  in  Figure  1-1.  can  oe  utilized  uetween  the  first 
and  second  frames  insteaa  of  waiting  for  the  fifth  and  sixth  frames  in  the  video 
stream. 

Referring  to  Figure  i-3,  we  aescriue  the  intelligent  tracker  functions  previ¬ 
ously  listed: 

1.1  TARGET  ACQUISITION  ANU  handover 

A  Horizontal  strip  of  a  fra-iie  is  snatcned  in  real  time  and  placed  in  a  frame 
memory.  The  cuer  processes  the  strip  and  detects  and  classifies  all  targets.  That 
part  of  the  fiame  store  holaing  each  target  and  its  surrounoing  gray  scale  window 
IS  sent  to  the  tracker  as  a  reference  image.  The  tracker  converts  this  to  a  binary 
reference  image  ana  now  tracks  this  target  from  the  video  stream  until  the  next 
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cuea  tramc.  The  ^.uer  tells  tne  tracker  where  the  target  is  within  the  frame  so  the 
tracker  can  tell  the  write  control  when  to  write  the  next  target  window  (now  from 
the  video  stream)  and  subse-  quent  windows  into  its  reference  image. 

1.2  MULTIPLE  TARGET  TRACKING 

Tnere  is  a  reference  image  RAM  reserved  fur  eacii  target.  Only  one  correlator 
may  be  required,  since  it  is  fast  enough  to  be  multiplexed  among  15  targets  per 
frame  (Second  Quarterly,  page  1-9).  The  tracker  is  a  bandpass  binary  correlation 
tracker  in  this  preliminary  design  and  the  oanopass  (band  of  gray  levels  represent¬ 
ing  the  target)  is  adjusted  at  the  cueing  rate  by  the  cuer.  Additionally,  the 
tracker  forms  a  smoothed  track  for  each  target  using  second  order  difference  equa¬ 
tions  ito  interface  with  tne  rate  loop  in  the  sensor  gimbal).  This  allows  reac¬ 
quisition  of  a  target  which  has  left  the  field  of  view  but  can  be  brought  back  into 
view  by  moving  the  sensor  along  tne  image  centered  target  track. 

At  this  point  one  can  picture  simultaneous  multitarget  cueing  ana  tracking. 

The  cued  targets  nave  been  classified  and  are  ordered  in  an  internal  table  in  terms 
of  priority.  The  priority  precedence  has  been  determined  before  the  mission  ana 
loaned  in  the  cuer  and  may  be  updated  as  conditions  change  during  tne  mission. 

1.3  TARGET  SIGNATURE  PREDICTION 

To  predict  obscurations,  a  histogram  of  tt.e  bacKgrouno  ahead  of  the  target  is 
analyzed.  The  track  window  errors,  which  are  used  to  form  the  smoothed  track  for 
reacquisition,  are  also  usea  to  determine  tne  pusition  of  the  histogram  window  in 
front  of  the  target.  From  the  nistogram,  we  can  compare  the  gray  levels  ahead  of 
the  target  with  those  of  tne  target.  If  the  same  gray  levels  are  present  in  both, 
a  clean  target  segmentation  is  unlikely.  Tne  track  point  position  is  adjusteo 
within  the  track  reference  window  so  tnat  the  tracxer  is  using  that  portion  of  the 
target  which  will  be  obscured  last  (e.g.  the  rear  of  a  target  passing  behind  some 
low  lying  shrubs).  Tne  background  histogram  is  also  analyzed  tor  a  polarity 
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reversal  between  tne  target  ana  the  background.  An  example  or  this  is  wnere  the 
target  is  moving  from  a  dark  background  into  one  lighter  than  itself  and  hence 
becomes  a  dark  target  against  a  light  background.  Under  this  condition,  the  new 
background  is  segmented  and  binary  change  detection  at  the  background  level  is  used 
along  -.itn  cirect  seg-  mentation  to  detect  tne  target.  Once  having  found  the 
emerging  edge  of  the  target,  the  track  point  in  the  reference  is  adjusted  to  the 
front  or  emerging  edge. 

1.4  REACQUISITION 

Tne  reacquisition  function  comes  into  play  when  track  breaklock  occurs.  It  is 
handled  by  a  combination  of  cueing,  histogram  analysis,  ano  change  detection.  A 
difficult  problem  is  the  reappearance  of  a  target  wtiich  is  partially  obscured. 
Segmentation  of  the  entire  shape  is  not  possible,  which  impedes  automatic  target 
recognition.  In  this  case,  scene  change  detection  provides  one  of  the  few  cues  for 
target  reemergence.  Performing  a  histogram  analysis  of  any  changed  areas  segmented 
adds  useful  information  because,  in  some  cases,  tlie  target  histogram  will  exhibit  a 
peakedness  not  found  in  a  background  object  such  as  a  woods  clearing.  That  is,  the 
histogram  will  resemble  a  Gaussian  rather  than  a  Uniform  Distribution.  Otherwise, 
the  clearing  may  be  mistaken  for  a  partially  occluded  target. 

1.5  AIMPOIinT  selection 

Aimpoint  selection  is  the  process  of  designating  a  vulnerable  point  on  the 

target,  aiming  tne  sensor  at  that  point,  and  homing  on  it  in  the  case  of  a 

missile.  Target  c 1  ass i f icat ion  and  target  asp'-ct  are  inputs  to  the  aimpoint 
selection  process.  Target  classification  is  available  to  the  intelligent  tracker 
as  tne  synergistic  result  of  incluoing  a  cuer  in  the  concept.  Target  aspect  can  be 
determined  from  target  motion,  from  the  orientation  of  the  target  edges  in  the 
image  plane,  and  fr’om  tne  orientation  of  tiie  principal  axes.  The  direction  of 

motion  is  used  to  determine  the  front  of  the  target  and  to  identify  those  target 
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eages  associated  with  the  sides  of  the  target,  i.e.  the  orientation  angles  of  those 
edges  should  be  similar  to  the  direction  of  motion.  Analysis  of  the  edge  joining 
the  "side  edge"  tends  to  indicate  the  orientation  of  tne  end  of  the  target  in  the 
image  plane.  The  orientation  of  the  principal  oxes  tends  to  indicate  the  bulk  of 
tne  upper  target  mass  (the  turret  in  the  case  of  a  tank),  and  aids  in  target  aspect 
determination.  This  approach  was  applied  success-  fully  to  a  maneuvering  target  , 

Given  a  classified  target  with  aspect  determined,  the  aimpoint  is  selected  by 
comparing  the  target  to  one  stored  in  memory  with  the  approximate  classification 
and  aspect.  The  aimpoint  is  placed  on  the  target  through  computations  based  on  a 
similar  triangle  argument  with  respect  to  the  stored  image.  Reference  points  are 
the  centroid  and  the  four  intersections  of  the  principal  axes  with  the  target  sil¬ 
houette  for  both  the  real  and  stored  targets. 

For  a  missile,  homing  at  long  range  is  controlled  by  the  binary  correlation 
tracker.  For  close  in  homing,  24  lines  of  target  or  more,  homing  is  based  on  the 
successive  registration  of  connected  edges  associated  with  the  complete  target 
image,  exterior  and  interior.  The  hand-over  ii'om  long  range  noming  to  close  in 
noming  is  accomplished  by  simultaneous  correlation  ana  registration  of  the  same 
target  image.  Tracking  ana  homing  are  conducted  at  frame  rates. 
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2.0  DETERMINATION  OF  AIMPOINT  SELECTION  INPUTS 


The  dimpoint  selection  algorithms  tor  the  intelligent  tracker  were  described  in 
the  Fourth  Quarterly  Report.  The  required  inputs  to  aimpoint  selection  are  target 
classification  and  target  aspect.  Determination  of  these  inputs  is  the  subject  of 
this  section.  The  aimpoint  selection  algorithms  are  briefly  reviewed,  the  source 
of  the  target  classification  input  is  discussed,  and  target  aspect  determination  is 
analyzed. 

2.1  REVIEW  OF  AIMPOINT  SELECTION  ALGORITHMS 

Prior  to  the  mission,  the  aimpoint  is  determined  by  analyzing  the  target's  vulner¬ 
able  area  and  the  most  likely  mission  scenario(s).  Once  determined,  the  aimpoint 
information  is  stored  in  terms  of  a  feature  vector,  F,  for  each  target  class  and 
several  aspects.  The  feature  vector  is  constructed  by  forming  a  silhouette  of  the 
target  for  a  typical  aspect  angle.  The  silhouettes  can  be  divided  between  those 
typically  seen  by  a  low-flying,  roll-stabilized  weapon  and  those  top  views  seen  by 
a  ballistic  weapon  with  a  near  vertical  descent.  During  the  mission  a  target  is 
detected,  classified,  and  its  aspect  estimated.  The  aspect  data  is  useo  to  select 
a  feature  vector  from  memory  for  that  class  of  target  and  an  aimpoint  is  computed 
on  the  actual  target. 

Figure  2-1  shows  the  principal  pattern  axis  drawn  on  a  tank  and  the  selected 
aimpoint.  The  offsets  of  the  aimpoint  from  the  four  intersections  of  the  target 
with  the  principal  axis  (R1-R4),  as  well  as  the  perpendicular  offsets  from  the  pat¬ 
tern  centroid  (RO),  define  the  aimpoint.  These  offsets,  along  with  the  major  axes 
of  the  pattern,  define  the  feature  vector.  Computations  based  on  similar  triangle 
arguments  are  used  to  transfer  the  aimpoint  from  the  reference  target  to  the  actual 
target.  Note  that  the  centroid,  principal  axes,  and  principal  axes-target  inter¬ 
section  calculations  are  computed  on  the  binary  image  which  is  used  to  form  the  re¬ 
ference  image  for  the  binary  correlation  tracker. 
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For  a  partially  obscured  targt  with  one  or  more  of  the  edges  (hence  offsets) 
missing,  the  aiinpoint  computation  is  very  similar.  Assuming  a  constant  orientation 
angle,  the  centroid  correction,  and  accompanying  offsets  can  be  computed  by  esti¬ 
mating  target  dimensions  before  obscuf'ation.  On  subsequent  frames  after  the  target 
has  been  obsucrea,  size  estimates  are  generated  and  compared  to  the  initial  esti¬ 
mates.  The  observed,  but  erroneous,  centroid  measurement  is  then  corrected  on  the 
basis  of  comparing  the  true  and  observed  pattern  sizes. 
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2.2  DETERMINATION  OF  TARGET  CLASS 

A  cuer  is  an  integral  part  of  the  intelligent  tracking  and  homing  system,  as 
described  in  Section  1.0;  target  class  is  an  output  of  the  cuer.  The  availability 
of  target  class  is  another  example  of  the  synergism  achieved  by  combining  a  cuer 
and  tracker  into  one  system.  Misclassif icat ion  results  in  an  erroneously  placed 
aimpoint  because  the  wrong  target  model  is  employed.  For  generality,  it  is  assumed 
that  the  target  classification  algorithms  contained  in  the  cuer  do  not  provide 
target  aspect  determination,  which  will  be  considered  next. 

2.3  TARGET  ASPECT  DETERMINATION 

Tnere  are  a  number  of  interwoven  issues  involved  in  aimpoint  selection  and 
target  aspect  determination.  These  issues  are  discussed  separately  in  the  next 
several  subsections.  In  the  concluding  subsection,  their  interrelationship  is  con¬ 
sidered.  The  issues  discussed  in  relation  to  target  aspect  determination  are  the 
allowable  missile  "g"  forces,  missile  guid=>nce  settling  times,  detection  and  class¬ 
ification  ranges,  steering  capability  vs  range,  objectives  of  exterior  aimpoint 
selection,  missile  trajectory  and  a  partial  determination  of  target  aspect, 
complete  target  aspect  determination,  and  simplification  of  the  close-m  homing 
problem. 

2.3.1  Allowable  Missile  "g"  Forces 

In  tne  Fourth  Quarterly  Report,  a  missile  and  its  dynamic  characteristics  were 
assumed.  The  maximum  turning  rate  was  3  g's.  For  a  missile  traveling  at  500 
meters  per  second,  this  corresponds  to  a  maximum  turning  rate  of  3.4  oegrees/ 
second.  This  means  that  the  target  aspect  determination  throughout  the  missile 
flight  must  be  accurate  enough  to  avoid  requiring  a  corrective  turn  exceeding  3.4 
degrees/second. 


-2-3- 


From  the  Fourth  Quarterly  Report,  the  assumed  missile  settling  time  is  one 
second  from  impact.  At  a  speeo  of  bOO  meters  per  second,  the  aimpoint  and  unoer- 
lying  aspect  determination  solutions  are  essentially  completed  at  a  range  of  500 
meters  from  impact.  The  impact  CEP  was  assumed  to  be  one  meter.  For  the  staring 
sensor  with  a  field  of  view  of  0.24  radians  and  an  angular  resolution  of  0.15  mrad, 
as  assumed  previously,  approximately  30  lines  will  appear  on  the  target  at  500 
meters.  This  means  that  the  major  course  corrections  for  aimpoint  selection  must 
occur  on  a  target  smaller  than  30  lines  high.  A  possible  ramification  of  this  con¬ 
straint  is  that  the  exterior  aimpoint  selection  process  must  put  the  aimpoint  with¬ 
in  a  one  meter  CEP  of  the  desired  impact.  This  desired  aimpoint  may  be  confirmed 
and  slight  steering  adjustments  initiated  by  an  interior  feature  which  is  not  yet 
visible.  Note  that  the  30  line  target  subtenas  approximately  0.7°;  hence,  slight 
adjustments  (measurea  in  pixels)  may  still  be  possible  before  impact. 

2.3.3  Detection  and  Classification  Ranges 

Detection  and  Classification  ranges  were  founo  to  be  2700  and  1600  meters  re¬ 
spectively,  based  upon  work  in  the  previous  Quarterly  Report.  The  missile  could 
begin  turning  toward  the  detected  targets  at  2700  meters  with  the  final  target 
selection  reserved  for  the  classification  range  cf  1600  meters.  At  this  range  the 
possible  targets  are  limited  to  those  within  6.8  degrees  (2.0  seconds  x  3.4 
degrees/seconds)  of  the  missile  longitudinal  axis  since  the  missile  has  only  two 
seconds  flight  time  to  close  to  within  500  meters  of  impact  (the  guidance  settling 
range).  The  other  factor,  3.4  degree j/second ,  is  the  maximum  turning  rate. 

Further,  at  a  range  of  1600  meters,  target  aspect  determination  can  be  estimated 
with  knowledge  of  the  target  class.  At  this  point,  the  target  is  only  some  8-10 
lines  high,  so  there  is  a  question  of  how  accurately  this  initial  aimpoint  ana 
underlying  aspect  determination  needs  to  be  calculated.  This  issue  leads  to  the 
discussion  in  the  next  paragraph. 


c  .  :  h't t'T  ofiiidiicfc  ot  Lxter  ior'  Auiipuint  btuLlK.i 

At  c  iabs  u  ical  lull  rdiigeS  OT  iobU  iiiL-ltrs  oiiu  w.Ln  u  tL-ii-lirifc  taiytt,  tne  oini- 
po’iit  iniayes  shown  in  the  previous  Quarterly  Report,  Hyure  b.l-lb  (iiiiayes 
dtb,  ...  kbO)  snow  a  variaUiiity  in  airiipoint  location  ot  1.0  to  b.O  pixels  trorii 
Ti^anie  to  frame  tor  tne  side  view  or  a  tank.  At  this  range  1.0  pixel  con espoiios  to 
u.b  meters,  so  the  variability  circle  has  a  raoius  of  1.2  meters,  inis  conipares 
favorably  witn  an  assumed  aimpoint  OEP  of  1.0  meters.  The  var iabiiu>  snouio 
uiminisn  as  tne  range  eloSeS  ana  tne  vioeO  S  igna  1 -lu-no  i  se  n.,Lio  ot  trie  exteiioi 
e'.ges  increases.  Tnis  means  tnat  a  serious  attempt  can  ue  niaoe  ot  classification 
ranges  to  place  tne  auiipoint  correctly  on  the  target.  Tnat  is,  tne  aimpoint 
tracking  performance  is  gooo  eiiouyn  to  go  beyono  just  placing  tne  ainipoint  anywhere 
cn  tne  target  anu  upgraoiny  the  accuracy  as  the  range  to  impact  oiniinisnes  furtner. 
2.3.0  Nissile  Trajectory  anu  Partiol  Target  Aspect  Determination 

A  oichotcmy  of  aiiiipoint  reference  images  baseo  on  low-flyuig  anc  oallistic 
trajectories  was  menticneo  in  para.  2.1  ana  is  aiscusseo  here.  The  loec  is  to  use 
tne  missile  trajectory  aiio  target  classification  information  to  compute  the  amouht 
or  target  sice  ano  top  area  which  appear  in  the  target  project lOii  in  the  image 
plane.  At  Classification  ranges,  it  is  difficult  to  oistinguisli  between  the  side 
anu  top  portions  or  tne  target  linage.  In  Figure  2-2,  tnree  views  of  a  square 
target  are  seen  wmcn  correspono  to  eacn  of  three  trajectories.  Trajectory  1,  rep¬ 
resenting  the  low-flying  case,  sees  onlj  the  side  view  of  the  target.  Trajectory 
2,  represent i  ig  a  neof-vert leal ,  ballistic  termiria'  phase  sees  only  the  top  of  tne 
t..r  get . 
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Figure  2-2.  Three  Trajectories  and  Their  Respective  Images 


Trajectory  2,  representing  a  45  degree  terminal  descent  sees  doth  the  side  and  top 
of  the  target  in  its  image.  For  the  discussion  at  hand,  consider  a  low-flying 
missile  at  3km  ground  range  from  the  target  at  an  altitude  of  61.1  meters  (200 
feet)  with  a  field  of  view  of  0,24  radians,  as  shown  in  Figure  2-3.  Assume  the 
angular  resolution  of  tne  sensor  is  0.15  mrad  and  is  equivalent  to  0.5  minutes  per 
line.  The  angle,©  between  the  vertical  and  the  base.  A,  of  the  target  is©^ 

=  tan  ^  (3000/61.12)  =  88°  50'  .  The  angle,  between  tne  vertical  and 
point  B  of  the  target  is©^  "  tan’^  (3000/59.12)  =  88°  52'.  Note  that  this 
implies  two  minutes  of  arc  are  on  the  target  or  4-5  lines  which  is  in  agreement 
with  the  previous  work  in  the  Fourth  Quarterly  Report  which  found  4-5  lines  on  the 
target  at  3000  meters  for  the  assumed  sensor  .  The  angle  from  the  vertical  to 
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Figure  2-3.  Geometry  for  Low-Flying  Missile  Case 

point  C  is  (3002/69.12)  =  88^  62.04'  so  the  increase  in  angular 

coverage  from  point  B  to  point  C  is  less  than  0.04  minutes  and  much  less  than  an 
additional  line.  Hence,  for  the  low-flying  case  in  the  vicinity  of  the  detection 
range,  the  largest  aspect  is  entirely  a  side  view,  and  is  Trajectory  1  in  Figure 
2-3. 

Consider  the  low-flying  missile  closing  to  a  classification  range  of  1500 
meters,  and  an  altitude  of  61.1  meters  (200  feet).  The  previous  calculations  can 
be  repeated.  Here,  9  ,  ,  =  87°  40',  9^  =  87°  44',  and  ©3  =  87°  44.3'. 

In  this  case,  the  missile  sees  a  part  of  the  top.  but  the  maximum  amount  of  top 
snown  is  no  more  than  one  line  on  a  target  which  has  eight  lines  of  side  pro¬ 
jection.  These  calculations  are  significant  because  they  provide  a  prior  knowledge 
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of  the  target  aspect  in  one  of  the  three  directions.  For  example,  given  the  target 
class,  sensor  resolution,  and  a  low-flying  trajectory,  the  number  of  lines  on  the 
target  can  be  computed  as  a  function  of  range.  Further,  the  number  of  lines  ap¬ 
portioned  between  the  siae  anu  top  can  be  computed  as  a  function  of  range.  Flence, 
at  some  point  in  the  trajectory  the  measured  number  of  lines  on  a  target  classified 
by  the  cuer  can  be  divided  between  those  representing  Llie  side  and  those  rep¬ 
resenting  the  top.  This  determines  the  target  aspect  in  a  plane  containing  the 
missile  and  the  target.  The  aspect  angle  is  measured  from  the  vertical  to  the  line 
of  sight  as  in  figure  2-3.  Finally,  this  computation  is  important  in 

determining  the  other  two  rotation  angles  for  tne  remainder  of  the  target  aspect 
determination. 

2.3.6  Development  of  j-D  Target  Aspect  Determination 

The  development  of  three-dimensional  target  aspect  determination  is  presented 
in  this  section.  The  aspect  which  a  ground  target  presents  to  an  incoming  missile 
is  a  function  of  tne  missile  trajectory,  the  position  of  the  missile  in  that  tra¬ 
jectory,  and  the  relative  orientations  of  the  ground  target  about  three  axes  of 
rotation.  To  simplify  tne  aspect  discussion,  consioer  the  missile  fixed  in  space 
at  some  arbitrary  point  and  the  ground  target  free  to  rotate  completely  about  a  set 
of  mutually  perpendicular ,  body  centered  axes  wnich  are  fixed  at  the  centroid  of 
the  target.  By  allowing  the  target  to  have  complete  rotation,  not  constrained  by 
the  ground,  any  target  aspect  can  be  reproduced  with  respect  to  the  missile  fixed 
in  space.  Figure  2-4  shows  the  three  mutually  perpendicular  axes  of  rotation.  The 
X  and  y 

axes  are  in  a  plane  containing  them  and  the  centroid.  The  z  axis  is  positive  out¬ 
ward.  Rotation  about  the  x  axis  produces  the  side  versus  top  view  discussed  in  the 
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Figure  2.4  Mutually  Perpendicular  Axes  of  Rotation  Fixed  in  the  Target, 

previous  section  and  seen  in  Figure  2-5.  Tite  second  rotation  is  about  the  y  axis 
and  produces  a  side-end  composite  view  as  shown  in  Figure  2-6.  The  third  rotation 
is  about  tlie  z  axis  and  produces  a  view  wtiich  shews  the  target  on  an  incline  as 
depicted  in  Figure  2-7. 

From  the  discussion  of  Section  2.3-6,  the  angle  of  rotation,  o<  ,  about  the  x 
axis  has  been  computed  from  the  missile  trajectory,  sensor  resolution,  ana  target 
classification.  Consider  the  computation  of  ^  for  the  case  of  a  rotation  about 
the  y  axis  as  shown  in  Figure  2-6.  Assume  the  target  shown  in  Figure  2-6  is  a  tank 
which,  referring  to  the  Fourth  Quarterly  Report  on  page  2-13,  is  six  meters  long, 
2.3  meters  wide  and  2.4  meters  high.  The  tank  dimensions  along  the  x  axis  can  vary 
from  a  minimum  of  2.3  meters  for  an  end  view  to  6.5  meters  for  a  siae  view  with 
45*^  rotation.  There  is  a  relation  between  the  measured  x  dimension  and  the 
amount  of  rotation  in  the  angle  beta  about  the  y  axis.  However,  there  are 
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ambiguities  in  the  relationship  which  are  discussec  by  considering  the  rotation 
siiown  in  Figure  2-8.  There,  two  ambiguities  arise  from  only  knowing  the  x 
dimension.  The  angle  of  rotation  can  be  45*^  or  135*^,  and  the  front  and  rear  of 
the  target  are  also  unknown.  Under  the  assumption  of  a  massive  armor  attack,  as 
discussed  in  the  Fourth  Quarterly  Report,  trie  tanks  will  be  moving  rapidly.  Hence, 
the  direction  of  motion,  as  determined  by  the  binary  correlation  tracker  and  the 
change  detection  algorithms  described  in  the  Tnird  Quarterly  Report,  can  be  used  to 
define  the  front  ana  rear  of  the  tank.  Further  analysis  can  be  divided  into  an 
examination  of  interior  snapes  and  exterior  con-  nected  edges  associated  with  the 
velocity  vector. 

To  resolve  tne  remaining  ambiguity,  consider  tne  approach  of  using  the  cuer  to 
segment  interior  tank  features  as  the  range  closes.  At  this  point  in 


Figure  2-5.  Rotation  about  the  x  Axis. 
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tne  analysis  it  is  important  to  discuss  the  problem  definition.  Tne  proposed  ap¬ 
proach  is  not  to  extract  and  classify  a  large  set  of  features  such  as  turrets,  gun 
barrels  and  so  on  and  then  classify  the  target.  Rattier,  tne  approach  is  to  extract 

a  small  set  of  shapes  wnich  are  known  to  exist  on  particular  portions  of  the  target 

that  has  already  been  classified.  This  is  a  significantly  smaller  and  more  tract¬ 
able  problem  and  will  owe  its  success  to  tne  synergistic  effect  of  combining  a  cuer 

and  tracker.  The  set  of  interior  shapes  should  be  large  enough  so  that  they  can  be 

represented  by  a  significant  number  of  lines  even  at  longer  ranges.  They  should 
exhibit  fairly  high  si gnal -to-noise  ratios  so  that  segmentation  is  straightforward. 
Finally,  they  should  have  a  geometric  position  on  the  tank  so  that  their  detection 
would  resolve  the  ambiguity  shown  in  Figure  2-8.  The  engine  hot  spot  is  one  such 
feature.  It  appears  at  the  rear  of  the  tank,  can  be  represented  by  as  many  pixels 
as  the  width  of  the  tank,  and  has  a  signal-to-noise  ratio  that  can  be  15  gray 
levels  higher  than  the  remainder  of  the  tank.  Additional  features  are  the  tank 
treads  as  seen  in  end  or  quartering  views.  The  treads  can  be  one-quarter  of  the 
width  of  the  tank.  A  pair  of  treads  on  one  side  of  a  tank  silhouette  indicates  an 
end  on  that  side,  and  the  signal-to-noise  ratio  can  be  seven  gray  levels  higher 
than  the  remainder  of  tne  tank,  excluding  the  engine  hot  spot.  Using  these 
features  to  resolve  the  ambiguity  of  Figure  2-8,  the  hot  spot  would  be  more  ap¬ 
parent  in  the  45^  rotation,  and  it  might  even  obscure  the  treads  which  would  ap¬ 
pear  on  the  left  end  of  that  target.  The  treads  would  be  more  apparent  on  the 
right  end  of  the  tank  silhouette  produced  by  the  135°  rotation.  In  fact,  in  both 
cases  tne  distance  between  treads  is  a  rough  measure  or  the  amount  of  rotation. 

This  approach  requires  a  set  of  such  features  stored  for  each  target  type. 
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A  compl ementary  approach  to  resolve  the  remaining  ambiguity  is  further  analysis 
baseo  on  motion.  A  tracked  vehicle  such  as  a  tank  travels  in  the  direction  of  its 
longitudinal  axis.  Tiie  longitudinal  axis  of  a  tank  is  parallel  to  its  sides.  Sup¬ 
pose  tne  segmentation  of  a  tank  is  composea  ot  a  set  of  connected  edges  for  which 
the  orientation  angles  in  the  image  plane  have  been  computea.  From  motion 
analysis,  the  orientation  of  tne  velocity  vector  in  the  image  plane  has  been 
computed.  By  comparing  the  orientation  angles  of  the  velocity  vector  with  those  of 
the  connected  edges  describing  the  target  exterior,  that  set  of  target  edges  which 
approximate  the  velocity  vector  can  be  labelled  as  the  side  of  the  tank.  Then  the 
remainder  of  the  edges  plus  the  orientation  of  the  principal  axes  may  be  used  to 
"flesh  out"  the  rest  of  the  tank. 

The  final  angle  of  rotation  for  target  aspect  determination  is  the  angleXabout 
the  outward  i  axis,  as  shown  in  Figure  2-7. 

X  Can  be  airectly  calculated  in  the  image  plane  by  computing  the  orientation 
angle  of  the  lower  or  bottom  edge  defining  the  target  shape.  The  procedure  for 
computing  orientation  angles  for  edges  was  presenteo  in  the  First  Quarterly  Report. 
2. 3. 6.1  An  Example  of  Target  Aspect  Determination 

In  the  above  section,  the  determination  of  each  angle  of  rotation  was  consi¬ 
dered  separately.  The  angle  oeterminations  are  now  considereo  together  in  an 
example.  In  ronsidering  the  motion  of  a  tank,  for  example,  it  is  noted  that  the 
longitudinal  axis  of  the  tank  is  generally  parallel  to  the  direction  of  motion. 
Further,  the  extracted  lower  edges  describing  the  sides  are  parallel  both  to  the 
motion  and  the  longitudinal  axis.  Hence,  as  an  a  first  approximation  to  the 
problem  of  deciding  which  rotation  angle  to  determine  first,  the  direction  of 
motion  is  found.  Then  that  target  lower  edge  which  is  parallel  to  the  direction  of 
motion  describes  the  siae,  as  shown  in  Figure  2-9.  A  rough  approximation  to  the 
end  of  the  target  can  now  be  attempted.  Important  features  are  a  pair  of  vertices 
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snuwn  it  points  A  and  B  (  ^J)  of  Figurs’  2-9.  Having  detected  these 

vcrt1L^■s.  as  shoun  in  Figure  2-9  by  dashed  line-  ,  the  remainoer  then  can  be 
completed,  shown  by  solid  lines.  From  the  87b-line  FLIR  data  supplied  by  NV&EOL, 
it  appears  tnet  information  of  this  type  is  discernable  on  target  of  12  lines 
high.  Now  the  angles  of  rotation  can  be  computed.  The  direction  of  motion  and 
i dent  it icat ion  of  tne  lower,  parallel  edge  suggest  whicn  side  (left  or  right)  will 
contain  the  end  view  ana  the  A  and  B  vertices. 


Figure  2-9.  Motion-Edge  Orientation  Angles 
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Figure  2-10.  End  Approximation 


2.4  AN  OVERVIEW  OF  AIMPOINT  SELECTION 

The  purpose  of  this  section  is  to  describe  aimpoint  selection  and  homing  from 
detection  range  to  impact  using  the  approach  described  in  the  Fourth  Quarterly 
Report  and  prior  sections  of  this  report.  Previously,  various  parts  of  the  problem 
have  been  considered  and  proposed  solutions  discussed.  Each  paragraph  below 
represents  a  significant  range  mark  in  the  clos'ng  trajectory. 

2.4.1  Detection  Range 

Detection  range  for  the  imaginary  FLIR  sense-'  described  in  the  Fourth  Quarterly 
Report  was  2700  meters.  Steering  should  be  initiated  at  these  ranges  upon  the 
detection  of  a  target  or  group  of  targets  in  orcer  to  keep  them  within  the  field 
of-view.  A  particular  formation  of  targets,  the  direction  of  motion,  and  hot  spots 
in  the  rear  portion  (determined  from  motion)  of  the  target  will  add  credibility  to 
a  turn  initiated  at  detection  range  and  beyond  classification  range. 
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At  a  classification  range  of  l&OO  meters  for  the  imaginary  FLlR  sensor,  target 
aspect  ii  computed  on  the  classified  targets.  For  the  tank,  the  silhouette  is  8  to 
10  lines  high  and  between  10  to  20  pixels  wide.  Assume  that  the  exact  width  of  tne 
target  is  not  known.  The  uncertainty  is  caused  by  gradient  descriptions  which  are 
ramp  functions,  and  segmentation  processes  which  tend  to  use  edge  values  based  on 
the  steepest  part  of  the  gradient  curve.  At  1600  meters,  the  target  subtends  about 
1.5  milliradians  in  the  vertical  dimension  and  3.0  milliradians  in  the  horizontal 
dimension.  The  maximum  turn  rate  for  a  3g  missile  is  3.4°  per  second  or  186  mil¬ 
liradians  per  second  so  the  missile  can  be  easily  steered  to  any  aimpoint  on  the 
target.  At  classification  range,  the  expected  performance  capability  allows 
placing  the  aimpoint  within  one  or  two  pixels  of  the  correct  location.  This  is 
satisfactory  because  it  means  that  subsequent  corrections  will  be  small. 

2.4.3  One-Half  of  the  Classification  Range 

At  one-half  the  classification  range,  or  800  meters,  the  target  is  16  to  20 
lines  nigh  and  20  to  40  pixels  wide.  The  signal-to-noise  ratio  has  mproved  by 
2-3db  and  tne  target  edge  is  tending  more  toward  a  step  function  than  a  ramp. 
Consequently,  the  exact  target  outline  is  better  ';nown.  As  seen  from  the  Fourth 
Quarterly  Report  (p.  4-11),  the  aimpoint  is  fluctuating  approximately  +  1  pixels 
about  a  center  point.  The  target  encompasses  3  milliradians  in  the  vertical 
dimension  and  6  milliradians  in  the  horizontal  dimension.  The  resolution  is  0.3 
meters  per  pixel.  This  translates  into  an  aimpoi  it  jitter  of  +  0.3  meters  and  an 
outline  uncertainty  of  perhaps  10%  or  0.32  meters.  The  aimpoint  is  settling,  and 
becoming  more  accurate  based  on  an  improved  exterior  edge  description. The  target  is 
cued  every  fifth  frame  and  tracked  with  a  bandpass  binary  correlation  tracker  in 
the  intervening  four  frames.  The  aimpoint  is  based  on  the  binary  target  outline 
formeo  in  the  inner  track  window. 
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^  2.4.4  One-Qudrte»"  of  tne  Classification  Range 

,  At  one-quarter  of  the  classification  range,  nr  400  meters,  the  missile  has 

I  entered  the  guidance  settling  period.  Only  small  cnanges,  if  any,  are  permitted  in 

the  missile  steering  and  the  aimpoint  selection  process  should  be  completed.  The 
target  has  doubled  again  in  size  and  in  number  of  lines,  and  the  target  outline  is 
oetter  defined.  The  tracking  has  switched  over  from  correlation  tracking  to  edge 
tracKing  as  descrioed  in  the  Fourth  Quarterly  Report.  The  discrepancy  in  handover 
was  approximately  one  pixel.  At  tnis  range  the  target  is  32-40  lines  high  and 
40-80  lines  wide.  Each  line  represents  0.15  meters  and  the  handover  from  cor¬ 
relation  to  edge  tracking  is  good  to  0.15  meters.  Further,  as  was  also  shown  in 
Che  Fourth  Quarterly  Report,  final  tracking  based  on  target  edges  for  64x64  pixel 
targets  is  also  accurate  to  1  pixel  or  0.15  meters  and  was  based  on  25  edges. 

Since  the  guidance  settling  period  has  started  the  missile  cannot  switch  to  another 
target  in  the  field  of  view;  tracking  is  now  confined  to  the  primary  target,  and 
the  remainder  of  the  field  of  view  can  be  ignored.  The  frame  rate  capability  of 
the  edge-baseo  tracker  is  increased  because  of  the  smaller  field  of  .lew.  This 
does  not  imply  that  classification  decision  should  be  held  off  until  just  before 
1/4  classification  range. 

Tne  results  of  Section  2.4  ana  missile  dynamics  will  be  considered  in  the  next 
quarter . 
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3.0  PRELIMINARY  RESULTS 

Tnis  section  presents  on  analyses  of  two  image  processing  problems  alluded  to 
in  the  previous  section.  The  first,  discussed  in  Sections  3.1  and  3.2,  concerns 
target  aspect  determination.  The  second,  described  in  Section  3.3,  concern  closein 
homing  (>24  lines)  and  tlie  registration  accuracy  under  conditions  of  emerging  in¬ 
terior  features,  target  movement  and  scale  manges. 

3.1  ASPECT  DETERMINATION  FOR  A  MANEUVERING  TARGET 

Tee  purpose  of  tnis  paragraph  is  to  consider  aspect  determination  for  a  maneu¬ 
vering  target;  i.e.,  the  target  aspect  changes  with  time.  Further,  the  target  will 
nave  portions  of  the  end,  side,  and  top  visible  The  target  motion  and  orientation 
angles  of  the  target's  edges  (in  the  image  plane)  are  used  to  determine  aspect. 
Since  tne  ciata  used  is  extensive,  tne  section  is  divided  into  several  parts.  The 
first  part  describes  the  input  data  and  the  second  describes  the  analysis.  The 
third  part  provides  an  additional  example 

3.1.1  Input  Data 

Tne  data  is  from  the  875  line  FLIR  ciata  base  supplied  by  NV&EOL  .  It  is  found 
at  position  5080  on  video  tape  "#5  11/15/77",  described  in  the  First  Quarterly 
Report.  A  tank,  initially  moving  un  a  road,  moves  off  onto  the  shoulder  as  shown 
in  Figure  3-1.  Another  vehicle,  a  jeep,  is  moving  ahead  of  the  tank  and  stays  on 
tne  r>.  ad  The  tank  passes  the  jeep  in  tne  last  p.irt  of  the  sequerice.  Tne  heli¬ 
copter  appears  to  be  hovering  with  an  initial  declination  angle  (between  horizontal 
ana  tne  tanx)  of  pernaps  30^  As  tne  tank  approaches  the  hovering  helicopter  the 
declination  angle  increases  to  an  estimated  80°.  The  portion  of  interest  is 
Shown  in  tne  ellipse  formed  tiy  the  dashed  lines  in  Figure  3-1. 
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Tnir  Intt- i  1  i gent  Tracker  is  cueing  every  riftt  frame  and  tracking  targets  on  the 
intervening  four  images  with  a  binary  correlation  tracker.  This  is  part  of  the 
system  concept  for  tne  Intelligent  Tracker  aescribed  in  Section  1.0.  The  oata  used 
for  the  analysis  starts  at  the  778th  frame  from  the  oeginning  of  the  sequence; 
every  ourth  frame  was  digitized  because  of  the  long  length  of  the  sequence. 
Further,  a  single  field  was  digitized  from  each  frame  to  reduce  the  amount  of  data 

hundicC  Aspect  determination  is  analyzed  lor  images  778.  782,  786  ...  .  838. 

where  images  778,  798,  318,  and  838  are  cued  and  the  other  images  are  tracked. 

Figure  .'i-za  shows  tne  segmentation  for  the  tank  in  image  778;  the  outline  of 

the  target  is  constructed  in  Figure  3-2b.  The  outline  was  liand  drawn  in  this  case 

but  tnere  are  algontfims  available  for  linking  edges,  one  of  which  was  describee  in 
the  First  (Quarterly  Report  ,  Superlink^  is  another  edge  linking  algorithm  based 
on  the  segmentation  format  presented  by  Superslice,  the  segmentation  algorithm  useu 
extensively  in  the  present  work.  Tne  track  windows  are  shown  in  Figure  3-3,  4,  ... 
1 .  Figure  3-3  is  the  track  reference  window  for  image  778,  and  Figu'-es  3-4,  5,  6. 

7  are  the  ti'.a.ik  windows  for  images  782,  786,  ...  794.  The  calculations  under  each 


4.- 


igiCp  3-23.  Segmented  Tank 


Figure  3-2b.  Tank  Outline 
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t.rdCK  ft'uftio  were  discussed  in  uttail  in  the  Fi.ur  n  Qi.drterly  Report  una  are  oriofly 


reviewed  here.  The  quantities  X8AR  ■■'d  YBAR  are  the  coordinates  of  the  target 

cenuroia  as  measured  from  trie  upper  lefthand  corner  of  the  track  window  with  the 

I  '  +x 

convention  .  The  quantities  SIG2X,  S1G2Y,  and  51GXY  are  the  second 

moments  (statistical  variances}  of  the  target,  and  fSIG2X  and  /sIg2Y,  stanriarc 
deviations,  are  a  rough  measure  of  the  target  X  and  Y  aimen-  sions  in  the  image 
plane  cooro  inates .  Theta  is  the  orientation  angle  ut  tne  prin-  cipal  axis  as 
measured  around  the  centroio.  In  tnis  case,  theta  measures  the  ori-  entation  of 
the  principa'  X  axis  from  the  X  axis  in  the  image  plane  as  seen  in  Figure  3-8.  Tne 
quantities  SZX  and  SZY  measure  the  X  and  Y  target  dimensions  in  the  directions  of 
tne  principal  axes.  The  quantities  D£LX,  DElY,  and  DIREt  measure  the  movement  of 
the  track  window  from  frame  to  frame  in  the  coordinate  system  of  the  image  plane. 

As  an  aside,  a  technical  point  regarding  data  presentation  must  be  made.  Note 
that  the  target  images  in  the  track  window  are  slightly  smaller  than  those  of  the 
segmentation  image.  Tne  target  was  close  to  the  bottom  of  ihe  digitized  window,  so 
a  small  track  window  had  to  be  used.  Hence,  the  Superslice  threshold  was  increased 
one  level  tc  p'^esent  a  smaller  target  and  facilitate  a  smaller  window.  The  effect 
on  the  results  was  not  significant  as  the  principal  axes  orientation  angles  remain¬ 
ed  approximately  tne  same.  This  problem  is  artiiicial.  In  obtaining  this  test 
data,  tiie  sensor  operator  did  not  center  the  tanet  as  the  cuer  would  have  done. 

to  Complete  tne  discussion  of  the  initial  data,  note  Figure  3-)),  which  is  the 
12S  X  128  pixel  window  grabbed  from  image  778.  Mgure  3-9  shows  the  target  for 
im.ige  7'j8  np  igh  ly  drawn  in  to  show  tne  amount  of  target  movement  rroin  image  7  78  to 
image  /9o.  Consider  tne  method  used  to  obtain  the  amount  of  target  movement  from 
image  7  78  to  7'^8. 
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Figure  3-8.  Principal  Axis  Orientation 

3. 1.1.1  Determination  of  Target  Movement 

This  description  is  included  under  the  Input  Data  paragraph  because  change  de¬ 
tection  has  been  discussed  in  detail  in  the  Second  and  Third  Quarterly  Reports. 
However  ■  a  variation  of  chose  techniques  is  applied  (lere  which  appears  to  be  an 
improvement.  Figure  3-10  shows  the  segmented  target  and  portions  of  the  background 
for  image  778.  Figure  3-11  is  the  counterpart  for  image  798  Note  that  the 
numbers  under  the  image  provide  statistics  for  each  seg-  mented  object  shown  in 
each  image.  The  BLOB  statistic  describes  which  object  (blob)  is  being  described  by 
the  particular  row  of  statistics.  AVG  is  the  average  gray  level  of  the  object. 

The  quantities  LFTX  (Left  x),  TORY  (Top  y),  RGTX  (Right  x),  and  BOTY  (Bottom  y) 
descrioe  the  X  and  Y  coordinates  at  the  extremes  of  each  object.  This  was 
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developed  to  provide  perspective  information  for  target  signature  prediction  and  is 
described  in  detail  in  tiie  Second  Quarterly  Report,  page  4-9.  The  data  tends  to 
indicate  the  distance  between  the  target  and  obscuration  and  whether  it  will  pass 
in  front  of  or  behind  the  obscuration.  For  example,  BOTYPBOTYi^  means  that 
target  1  is  below  target  2  in  the  image.  When  they  cross,  target  1  will  be  in 
front.  The  quantities  XBAR  and  YBAR  are  tne  X  Y  coordinates  of  the  objects'  cen 
troid.  DELX  and  DELY  are  the  X  and  Y  dimensions  of  the  object  and  SIZE  indicates 
target  area  in  pixels.  PERPTS  is  the  number  of  perimeter  points  (object  pixels 
having  at  least  one  neighboring  pixel  at  the  zero  gray  level)  in  the  object.  Sup 
pose  tne  objects  are  renamed  T  for  target.  Area  2  for  that  portion  of  background  in 
the  lower  part  of  the  window,  and  Area  1  for  that  portion  of  the  background  in  the 


right  side  of  the 

image  and  above 

the  target. 

The  new  nomenclature  is  shown  in 

Figure  3-12,  page 

3-7.  Figure  3- 

13  shows  the  X 

,Y  coordinates  (XBAR,  YBAR)  for  the 

centroids  of  tne 

target,  AREA  1, 

and  AREA  2  for 

both 

images. 

T  arget 

Area 

1 

Area  2 

IMAGE 

XBAR  YBAR 

XBAR 

YBAR 

XBAR  YBAR 

778 

52  1  16 

107 

91 

69  123 

798 

43  110 

105 

82 

69  117 

X 

-09 

-02 

0 

y 

-06 

-09 

-06 

Figure  3-13.  Object  characteristics  for  Images  778  and  798 
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The  amount  of  change  (Ax, Ay)  in  XBAR  and  YBAR  for  each  object  is  shown 
in  Figure  3-13.  The  target  movto  nine  pixels  to  the  left  and  six  pixels  up; 
AREA  1  moved  two  pixels  to  the  left  and  nine  pixels  up;  and  AREA  2  remained 
stationary  in  x  ano  moved  six  pixels  up.  The  changes  indicate  that  the  ob¬ 
jects,  as  a  collection,  moved  to  the  left  and  up  in  the  coordinate  system  of 

the  image  plane  which  is  positive  to  the  right  in  x  ano  positive  down  in  y, 

rx+ 

,  as  measured  from  the  upper  left  hand  corner  of  the  image.  The 
amount  of  change  indicates  tiie  range  over  •.-.'hich  the  maximum  correlation  for 
binary  change  detection  will  be  obtained.  For  example,  the  maximum  correla¬ 
tion  in  X  should  occur  in  a  neighborhood  about  the  interval  -09  to  0;  the 
maximum  correlation  in  y  should  occur  in  a  neighborhood  about  the  inteval  -06 
to  -09  Hence,  there  is  no  need  to  search  an  interval,  e. g.  of  -32  to  +32 
pixels  in  x,  and  -32  to  +32  pixels  in  y.  If  a  neighborhood  of  +  5  pixels 
were  chosen  about  tne  changes,  the  range  of  possible  correlations  would  be 
-14  to  +5  in  X  and  -01  to  -14  in  y.  This  reduces  the  number  of  correlation 
computations  for  binary  change  detection  from  4096  (64x64)  to  285  (19x15). 
Second,  a  window  encompassing  AREA  1  and  AREA  2  and  not  the  target  is  useo 
for  change  detection  to  obtain  scene  movement  caused  by  sensor  motion. 

Figure  3-14  shows  the  computer  printout  from  the  binary  change  detection  for 
a  64x64  window  containing  AREAS  1  and  2.  The  windows  were  matched  oest  at 
-1,  -9  as  shown  in  Figure  3-14.  This  means  that  AREAS  1  and  2  for  image  798 
were  one  pixel  to  the  left  and  nine  pixels  above  (-1,-9)  the  locations  of 
AREAS  1  and  2  for  image  778.  The  target  position  for  image  798  was  nine 
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Figure  3-14.  Binary  Change  Detection  Printout 


pixels  to  the  left  and  six  above  the  target  location  for  image  778.  But  this 
motion  included  target  motion  plus  sensor  motion.  Tne  former  (-1,-9)  displacement 
is  for  sensor  motion  only.  Hence,  sub-  tracting  the  two,  (-1,-9)  (-9,-6),  gives 
(-8, +3)  as  target  displacement  alone  This  is  to  tne  left  ana  aown. 

In  conclusion,  the  magnitude  of  the  binary  change  detection  computation  used 
here  was  limited  in  window  size  to  a  64x64  (encompassing  AREA  1  and  AREA  2)  ana  in 
correlation  range  by  using  information  already  developed  about  the  images. 

Further,  the  x,  y  information  developed  for  AREA  1  and  AREA  2  is  a  rough  early 
indication  of  the  change  detection  results. 

3.1.2  Analysis 

The  input  data  for  target  aspect  determination  has  been  obtained.  The  target 
has  been  segmented,  and  an  edge  outline  has  been  completeo.  The  target  dimensions 
about  the  principal  axes,  the  orientation  angles  of  the  principal  axes  and  target 
edge  outlines  in  the  image  plane,  and  direction  of  motion  have  been  obtained.  Note 
that  the  only  supplemental  algorithm  necessary  is  the  edge  orientation  algorithm, 
one  form  of  which  we  described  in  the  Fourth  Quarterly  Report  and  which  is  used  in 
close-in  homing.  Hence,  we  are  not  introducing  any  new  algorithms  for  target 
aspect  determination  but  rather  using  only  available  information.  Figure  3-15 
shows  the  orientation  angles  in  the  image  plane  coordinates  (x.j,  y\j),  for  the 
case  at  hand.  The  orientation  angle  of  the  principle  x  axis,  xp,  is  -28°, 

-23°,  -44°,  -33°,  and  -25°  from  the  track  windows  shown  in  Figures  3-3 
through  3-7.  This  is  represented  at  &  =  30°  in  Figure  3-15.  Further,  the  total 
length  of  the  target  along  the  principal  x  and  y  axes  is  5  and  10  pixels,  respec¬ 
tively,  which  falls  within  the  range  of  aspect  ratios  for  tanks.  The  results  of 
the  change  detection  record  show  that  the  target  movement  from  image  778  to  image 
798  is  approximately  -8  and  +3  in  x.  and  y.. 
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Figure  3-15.  Orientation  Angles  in  the  Image  Plane 


Figure  3-16  shows  the  movement.  Hence,  the  orientation  angle  of  the  target  veloc¬ 
ity  vector  with  respect  to  the  +y.j  axis  is  -69*^.  Compare  the  orientation 
angles  of  the  velocity  vector  cind  principal  axes  orientation  with  those  of  the 
target  edges  shown  in  Figure  3-2b  and  redrawn  in  Figure  3-15.  To  put  the  angles  in 
a  common  coordinate  system,  we  measure  the  angles  from  the  horizontal  and  in  a 
counterclockwise  direction.  The  lower  slanted  edge  of  the  target  has  an  orien¬ 
tation  of  approximately  30*^  ano  the  target  velocity  vector  has  an  orientation  of 
21°  in  tiie  same  system.  The  upper  slanted  target  eoge  has  an  orientation  of 
56°  and  the  princip-'l  y  axis  has  an  orientation  of  60°  in  the  same  system. 

Hence,  a  number  of  conclusions  can  be  drawn  about  the  target,  as  follows. 

The  sides  of  the  tank  are  generally  oriented  to  align  with  the  direction  of 
motion  of  the  tank  in  straight  motion.  Hence,  that  edge  most  closely  aligned  with 
the  velocity  vector  (edge  3)  probably  represents  the  side  of  a  tank.  The  lower 
part  of  the  tank  is  the  front  of  the  tank  because  it  is  in  the  direction  of  motion; 
the  upper  part  of  the  tank  is  directed 
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Figure  3-16.  Target  Movement  Direction 


opposite!}'.  Tne  relative  orientations  of  edges  1,  2,  and  3  plus  the  suppositions 
that  edge  3  is  the  side  of  the  tank  and  the  lower  part  of  the  tank  is  the  front, 
allows  the  construction  of  tne  dotted  lines  as  shown  in  Figure  3-17  which  complete 
the  orientation  for  the  oblique  view  presented.  The  orientation  of  edge  4  is  not 
aligned  with  that  of  edge  3  because  it  must  account  fur  the  presence  of  the  tur¬ 
ret.  Implementation  will  be  considered  in  the  Final  Report. 

Having  presented  data  and  analyzed  target  aspect  for  images  778  through  798, 
consider  the  techniques  just  developed  as  applied  to  798  versus  818. 

3.1.3  Images  798  versus  818 

The  scenario  just  described  is  now  analyzed  over  the  next  set  of  tracked 
images:  802,  806,  810,  814  and  the  cued  image  818.  This  is  a  particularly  inter¬ 
esting  set  because  the  target  is  turning  and  changing  aspect,  in  the  presence  of 
clutter  or  a  potential  obscuration.  A  conflict  can  arise  between  obtaining  a 
complete  target  outline  for  aimpoint  purposes  with  a  higher  risk  of  track  loss,  and 
obtaining  a  less  descriptive  target  outline  and  a  lower  risk  of  track  loss. 
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Figure  3-17.  Oblique  View  of  Tank 


The  purpose  of  this  section  is  to  examine  the  problem  of  target  aspect  determi¬ 
nation  for  a  maneuvering  target  in  the  presence  of  potential  obscurations.  First, 
the  problem  is  described  in  the  context  of  the  binary  bandpass  tracker.  Second, 
images  802,  806,  810,  and  814  are  presented  and  the  problem  described  in  terms  of 
the  data.  Thiro.  the  proposed  approach  to  aspect  determination  is  described.  This 
approach  is  the  same  as  that  described  in  the  previous  section.  Fourth,  the 
proposed  approach  is  applied  to  the  data  and  the  results  described.  Finally,  an 
amendment  to  the  intelligent  tracker  system  concept  is  described  which  ties  to¬ 
gether  two  parts  of  the  concept  whose  relationship  was  brought  out  by  this  test 
data.  The  relationship  has  to  do  with  the  handover  between  the  cuer  and  tracker. 
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Problem  description  -  In  the  previous  intelligent  tracker  work,  emphasis  has 
been  placeo  on  track  continuity  by  using  a  cuer  coupleo  with  a  bandpass  oinary  cor¬ 
relation  tracker.  The  tracker  consists  of  an  interior  and  exterior  window  as  shown 
in  Figure  3-18.  The  interior  window  is  placed  over  the  cued  target  such  that  90 
percent  of  the  target  is  within  the  interior  window.  An  exterior  window  is  select¬ 
ed  ana  centered  on  the  interior  window,  which  is  double  the  height  and  width  of  the 
interior  window.  A  histogram  of  the  exterior  window  represents  the  background.  A 
discriminant  function  is  constructed,  (e.g.  Neyman-Pearson) ,  which  maximizes  the 
metric  (distance)  between  the  two  histograms.  Essentially  this  has  the  effect  of 
eliminating  gray  levels  which  are  common  and  have  substantial  representation  in  the 
background  histogram.  For  pixels  in  the  exterior  window  at  the  same  level  as  the 
target,  these  procedures  eliminate  those  levels  from  the  target.  The  result  is  a 
less  than  accurate  description  of  the  target,  a  less  accurate  estimate  of  target 
aspect,  and  an  erroneously  placed  aimpoint. 


Figure  3-18.  Track  lindow  Arrangement 
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Data  presentation  -  The  cued  target  for  frame  798  is  shown  in  Figure  3-19.  The 
target  was  thresholded  at  39  for  a  maximum  edge/perimeter  match.  The  correspond¬ 
ing  binary  reference  window  is  shown  in  Figure  3-2U  and  has  been  thresholded  by  the 
tracker  at  40.  The  reason  for  this  is  shown  in  Figure  3-21,  which  is  a  composite 
of  the  interior  and  exterior  windows.  Figure  3-21  shows  the  target  thresholded  at 
39  and  40;  it  also  shows  the  presence  of  background  at  the  39  level  in  the  lower 
right  hand  corner  of  the  exterior  window.  This  is  the  reason  for  the  new  thres¬ 
holding  by  the  tracker.  The  track  windows  for  images  802,  806,  810,  and  814  are 
shown  in  Figures  3-22,  23,  24,  and  25.  Recall,  the  target  aspect  determination  ap¬ 
proach  described  in  3.1.2.  Edge  orientation,  motion,  and  principal  axis  orienta¬ 
tion  are  used  primarily  to  determine  aspect.  Figures  3-26,  27,  28,  and  29  show  the 
target 


Figure  3-19.  Cued  Target  Image  798 
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3-20.  Binary  Reference  Uindow  Imaoe  798 
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Sector  1  Histogram 
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Figure  3-30.  Histograms 
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at  a  threshold,  of  40  for  the  same  set  of  images.  There  is  a  marked  difference 
between  the  two  sets  of  images.  Note  in  particular,  the  orientation  angle  and 
extent  of  the  bottom  horizontal  edge.  The  first  set  of  images  indicates  that  the 
target  is  roughly  going  in  the  same  direction  and  orientation,  while  the  second  set 
indicates  that  the  vehicle  is  turning  and  changing  aspect. 

Proposed  approach  -  To  solve  the  above  problem,  we  make  use  of  the  tools 
already  constructed  for  the  intelligent  tracker  and  described  in  the  Third 
Quarterly  Report.  The  direction  of  motion  of  tfie  target  nas  been  determined  as 
described  in  the  previous  section.  Based  on  this  motion,  histograms  are  construc¬ 
ted  ahead  of  the  target  at  approximately  ttie  same  height  (line)  as  the  target.  The 
histograms  are  shown  in  Figure  3-30  for  image  798,  and  are 

used  to  predict  target  obscurations.  Note  that  these  histograms  do  not  predict  ob¬ 
scurations  at  threshold  39  in  the  direction  of  motion.  Hence,  the  correlation 
tracker  can  be  overridden  and  the  cued  threshold  used  for  the  tracker.  This  rela¬ 
tionship  between  the  target  signature  histograms  and  the  exterior  track  window  was 
Drought  into  focus  with  this  set  of  data. 

Target  aspect  determination  -  The  target  aspect  determination  techniques 
described  in  the  previous  sections  are  now  applied  to  tne  images  of  Figure  3-26 
through  3-29  (the  images  segmented  by  the  cuer).  The  bottom  edge  has  lengthened 
substantially,  and  the  30°  edge  associated  with  the  motion  nas  become  a  near 
horizontal  edge.  The  motion  between  images  798  and  818  shown  in  Figures  3-31,36  is 
now  discussed.  A  portion  of  tne  segmented  background  for  image  798  is  shown  in 
Figure  3-32  and  the  same  section  is  shown  in  Figure  3-33  for  image  818.  A  print 
out  of  the  matched  registered  binary  windows  and  the  registration  results  are  shown 
in  Figure  3-34.  The  changes  between  the  two  windows  are  shown  in  Figure  3-35. 


-3-25- 


BLOB 

AVG 

LFTX 

1 

38.70 

89 

1 

39.45 

62 

BLOB 

DELX 

DELY 

1 

38 

34 

1 

21 

24 

TORY 

RGTX 

BOTY 

30 

126 

63 

66 

88 

89 

SIZE 

PERPTS 

777 

191 

417 

117 

Figure  3-33.  Image  818 
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Figure  3-34  shows  that  the  movement  between  images  is  (-2,  -33),  i.e.  image  818 
moves  two  pixel  to  tne  left  ano  33  up  from  image  798.  The  target  movement  is  shown 
in  Figure  3-34  and  includes  sensor  movement.  It  is  (-7,  -35),  hence,  the  net 
target  movement  is  (-5,  -2)  and  the  bulk  of  the  movement  corresponds  to  a  changed 
target  aspect. 
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OLD  WINDOW  CORNER  (OKSTR.OISTR) =  63 

CORNER  OF  WINDOW  MATCH ( KSAVE  ,  I  SAVE) =  61 


Figure  3-34.  Matched  Windows 


3.2  TRACKING  STABILITY  AND  INTERIOR  FEATURES 


In  the  Fourth  Quarterly  Report,  the  intelligent  tracker  homing  concept  was 
described.  It  consisted  of  correlation  tracking  on  the  target  exterior  at  long 
range  and  a  transition  to  edge  tracking  for  close-in  homing.  The  transition  took 
place  at  roughly  500  meters  from  impact  with  the  target  30  lines  high.  The  track¬ 
ing  accuracy  was  found  to  be  approximately  one  pixel  with  a  transition  (hand  over) 
accuracy  of  one  pixel,  also.  The  tracking  in  both  regimes  is  based  on  the  exterior 
target  shape.  In  this  paragraph,  the  analysis  is  taken  a  step  further,  and  track¬ 
ing  accuracy  is  examined  for  the  case  of  close-in  homing  and  emerging  interior 
features.  The  emerging  interior  features  will  change  the  target  signature.  Edges 
will  be  added  to  the  interior.  An  important  consideration  is  whether  these  added 
edges  cause  confusion  or  by  gradually  emerging  reinforce  the  exterior  edges  in  the 
registration  process.  If  the  latter  is  true,  then  the  exterior  edges  initially 
overwhelm  the  emerging  interior  edges.  Then  as  the  interior  edges  become  repeat- 
able  from  frame-to-frame,  they  will  contribute  positively  to  the  overall  regist¬ 
ration  process. 

Another  issue  in  the  analysis  is  the  tracking  accuracy  under  a  scale  change 
from  frame  to  frame.  Consider  the  geometry  of  the  situation.  The  missile  is  at  a 
range  of  500  meters  from  the  target  which  is  30  lines  high.  It  is  closing  at  a 
velocity  of  500  meters  per  second.  The  target  will  be  60  lines  high  at  250  meters 
from  impact,  which  will  occur  one-half  second  and  15  frames  later.  In  this  inter¬ 
val  the  target  height  is  growing  at  2  pixels  per  frame  (approximately  a  10%  change 
in  target  area  per  frame  depending  on  target  aspect);  closer  in,  the  target  will 
grow  faster.  This  is  the  scale  change  from  frame  to  frame. 
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Adaitional ly,  there  is  lateral  movement  from  frame  to  frame  caused  by  the 
motion  of  the  target  and  missile.  Assume  the  target  is  moving  perpendicularly  to 
the  field  of  view  at  30  miles  per  hour.  This  is  approximately  equivalent  to  a  two 
pixel  movement  per  frame  at  500  meters.  The  intelligent  tracker  algorithms  were 
tested  on  data  which  presented  target  growth  rates  of  two  to  six  pixels  per  frame 
and  with  18  pixels  target  movement  between  frames. 

The  test  data  selected  from  the  NV&EOt  875-line  FLIR  data  base  was  from  Tape  #5 
11/15/77,  tape  position  3180.  Figure  3-37  shows  the  scenario.  The  tank  starts 
from  the  left  side  of  the  road.  The  helicopter  is  at  long  range  initially.  The 
helicopter  flies  over  the  tank  when  it  is  between  two  ARC'S. 

The  processed  images  were  approximately  1200  frames  into  the  run  and  are  char¬ 
acterized  in  Figure  3-38.  Note  that  in  one  ..ase,  131b  versus  1330,  the  difference 
in  image  size  is  20  percent,  which  characterizes  the  target  growth  rate  at  ranges 
closer  than  250  meters.  The  target  grows  at  a  rate  of  8  pixels  per  frame  as  the 
range  closes  to  125  meters  and  with  one-quarter  of  a  second  to  impact. 


Figure  3-37.  Scenario 
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Figure  3-38.  linage  Charac  ter  i  st  ics 


The  images  are  shown  in  Figure  3-39  througn  3-43.  The  extracted  edges  for 
each  of  the  images  are  shown  in  Figures  3-41  through  3-48.  Note  that  the  aim- 
point  is  includea  in  Figures  3-39  through  3-43.  The  aimpoint  was  placed  on 
image  1267,  successive  registrations  were  then  placed  on  it  on  the  following 
windows.  The  successive  aimpoint  placement  was  achieved  by  registering  an  80 
pixel  by  bO  pixel  image  containing  the  target  on  tne  following  image  of  128 
pixels  by  128  pixels  in  size.  To  test  the  accuracy  of  the  registration 
results,  a  point  was  selected  on  each  tank  which  appears  repeatable.  In  this 
case,  the  point  selected  was  an  interior  feature  which  appeared  to  be  the 
"treau  wheel"  or  sprocket  wiiicn  was  approximately  2  pixels  by  2  pixels. 
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The  difference  between  the  sprocket  position  in  the  pairs  of  images  is  shown  in 


Figure  3-49.  The  registration  results  are  also  shown  for  the  image  pairs. 
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Figure  3-49.  Registration  Results 


In  conclusion,  the  results  are  in  the  same  accuracy  ronge  as  the  previous 
results  shown  in  the  Fourth  Quarterly  Report.  These  accuracies  in  pixels 
translate  into  linear  error  which  is  quite  small.  Tnere  is  approximately  a 
0.1  meter  per  pixel  at  these  ranges,  so  the  linear  error  is  of  the  order  of 
approximately  . 1 -. 2m. 
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4.0  HARDWARE  IMPLEMENTATION 


The  purpose  of  this  section  is  to  describe  the  implementation  of  the  intelli¬ 
gent  tracker  algorithms  in  very  large  scale  integrated  (VLSI)  and  very  high  speed 
integrated  (VH5IC)  circuit  hardware.  The  first  paragraphs,  "System  Constraints" 
and  "Hardware  Constraints",  serve  as  introductions  and  describe  the  approach 
selected.  The  third  paragraph,  "Feasibi 1 ity-An  Overview",  is  a  general  evaluation 
of  feasibility  and  it  describes  the  promise  of  VLSl/VHSIC  implementation  of  the  in¬ 
telligent  tracker  algorithms.  The  fourth  paragraph,  "Parallel  Implementation  of 
the  Segmentation  Algorithms",  offers  an  approach  to  segmenting  an  object  which  lies 
across  the  artificial  boundaries  created  by  parallel  implementation.  The  fifth 
paragraph,  "Superslice  -  A  Closer  Look",  examines  the  algorithm's  performance  as 
seen  in  the  intelligent  tracker  work  and  suggests  a  degree  of  implementation  rigor 
necessary  for  adequate  performance.  Also,  a  Superslice  implementation  is  sketched 
which  uses  substantial  word  widths  as  contrasted  with  bit  plane  and  cellular  array 
machines.  The  final  paragraph,  "Future  Directions",  describes  hardware  imple¬ 
mentation  efforts  to  be  pursued  in  the  next  quarter. 

4.1  SYSTEM  CONSTRAINTS 

In  the  Second  and  Third  Quarterly  Reports,  tactical  scenarios  were  developed  in 
which  the  intelligent  tracker  played  a  major  role.  It  is  not  the  purpose  of  this 
section  to  review  that  work,  but  rather  to  select  one  scenario  which  presents  a 
greater  hardware  implementation  challenge  than  some  of  the  others.  We  will  show 
that  the  scenario  chosen  can  be  realized  as  a  practical  system  within  the  time 
frame  of  VHSIC  and  should  benefit  substantial ly  from  the  VHSIC  program  developments. 

The  Intelligent  Tracker  found  application  onboard  the  helicopter,  RPV,  and 
tactical  missile  (Precision  Guided  Missi le,  PGM).  The  hardware  implementation 
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challenge,  in  terms  of  volume  and  power,  seems  to  be  the  greatest  for  the  small 
tactical  missile.  Volume  and  power  constraints  of  approximately  16  cubic  inches 
and  16  watts  have  been  stated  for  the  155  mm  shell  (3  inch  diameter)  by  ARRADCOM 
personnel.  The  helicopter,  on  the  other  hand,  may  be  assumed  to  accept  require¬ 
ments  for  less  than  500  cubic  inches  and  150  watts  of  power.  The  RPV  has  several 
designs  which  fall  in  this  range,  with  the  mini-RPV  requirements  tending  toward  100 
in^.  The  helicopter,  in  terms  of  the  number  of  pixels  processed  per  second,  will 
probably  require  a  higher  rate  than  the  small  tactical  missile  for  which  a  staring 
mosaic  of  perhaps  256  pixels  x  256  pixels  can  be  projected.  At  a  frame  rate  of  30 
frames  per  second,  the  processing  rate  for  the  helicopter  is  22.5  x  10^ 
pixels/sec.  and  1.97  x  10^  pixels/sec.  for  the  small  tactical  missile.  There  is 
roughly  an  order  of  magnitude  difference  in  required  speed.  The  real  question  is 
whether  the  small  tactical  missile  volume  and  power  density  goals  can  support  the 
memory,  control,  overhead,  and  image  processing  algorithms  necessary  to  achieve  a 
2  megapixels/sec.  rate.  In  a  larger  volume,  for  example,  additional  processors  can 
be  added  to  obtain  higher  speeds  with  a  sometimes  minimal  increase  in  overhead. 
However,  in  the  small  tactical  missile  case,  the  overhead  will  represent  a  higher 
percentage  of  the  overall  volume.  An  example  can  be  found  in  the  case  of  a  power 
supply.  A  fixed  volume  of  equipment  is  necessary  to  achieve  levels  of  power  below 
approximately  75  watts.  That  is,  a  power  supply  of  25-75  watts  will  have  the  same 
volume.  Above  75  watts,  the  volume  increases  linearly  with  power.  In  conclusion, 
the  small  tactical  missile  (and  artillery  shell)  probably  offers  the  greatest  hard¬ 
ware  implementation  challenge  because  the  ratio  of  fixed  overhead  (control,  memory, 
etc.)  processing  to  ALU  (Arithmetic  Logic  Unit)  processing  will  be  higher  than  for 
the  other  systems. 
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The  small  tactical  missile  with  image  processing  is  a  potent  weapon  and  is  the 
forerunner  of  the  smart  artillery  shell.  Weapons  systems  based  on  these  concepts 
appear  to  have  great  leverage  in  redressing  the  massive  numerical  advantage  enjoyed 
by  the  WARSAW  PACT  and  are  considered  high  priority  DoD  developments. 

The  small  tactical  missile  scenario  selected  should  be  realistic  and  practical 
within  the  1985  time  frame  which  is  the  projected  availability  date  for  VHSIC 
chips.  There  is  a  hierarchy  of  scenarios  which  vary  from  a  missile  which  tracks  a 
laser  spot  on  the  target  to  the  completely  autonomous  ("fire  and  forget"  or  "lock 
on  after  launch")  artillery  shell  with  a  range  of  20  km.  As  we  move  from  the 
laser-guided  missile  to  the  completely  autonomous  artillery  shell  the  degree  of 
complexity  of  the  sensor-image  processing  system  increases.  The  laser-guided  tac¬ 
tical  missile  is  in  the  field  today;  the  threat  to  this  system  is  the  increasing 
vulnerability  of  the  launch  helicopter  which  also  provides  the  laser  spot.  In  the 
Second  Quarterly  Report  (p.  1-13),  a  maximum  exposure  time  of  20  seconds  was 
assumed  before  survivability  dropped  drastically.  A  tactic  to  circumvent  this 
problem  is  to  have  another  concealed  helicopter  place  the  laser  beam  on  the  target 
from  a  mast  mounted  sight.  The  missile  is  launched  from  the  first  helicopter  in 
cooperation  with  the  second.  The  next  step  in  the  hierarchy  is  to  delete  the  laser 
and  use  the  same  helicopter  to  designate  targets  and  launch  the  missiles.  This  is 
done  by  providing  image  processing  for  both  the  helicopter  and  missiles  which 
registers  the  scene  and  the  target  between  the  two  before  launch.  The  missile 
sensor  resolution  is  such  that  the  target  is  seen  by  the  missile  before  launch. 

This  process  is  called  "lock  on  before  launch"  and  is  described  in  some  detail  in 
the  Second  and  Third  Quarterly  Reports.  The  remainder  of  the  steps  in  the  hier¬ 
archy  are  associated  with  the  process  of  "lock  on  after  launch,  (LOAL)"  which 
culminates  in  the  completely  autonomous  artillery  shell.  The  first  step 
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in  the  LOAL  hierarchy  finds  the  helicopter  obtaining  an  image  of  the  target  scene 
in  the  pop-up  mode.  The  helicopter,  initially  below  the  tree  tops,  pops-up  into 
view,  obtains  an  image  of  the  scene,  places  it  in  a  frame  storage  device,  and  ducks 

back  down.  The  scene  is  then  transferred  to  the  missile  and  it  is  launched  from 
the  helicopter  while  it  is  still  down  behind  the  trees.  The  launch  trajectory 
takes  the  missile  above  the  trees  and  toward  the  target  on  a  course  which  may  or 
may  not  duplicate  the  perspective  and  orientation  from  which  the  helicopter 
obtained  its  reference  image.  Complex  processing  routines  in  the  missile  register 
the  reference  image  with  the  image  now  seen  and  set  the  missile  guidance  controls 
accordingly.  Further,  the  missile  sensor  resolution  need  not  be  as  good  as  the 
helicopter  sensor  resolution  because  it  can  steer  using  back-  ground  scene  matching 
until  it  comes  within  detection  range  of  the  target. 

The  next  step  in  the  hierarchy  involves  the  20  km  smart  artillery  shell  and 
consists  of  two  phases.  The  first  phase  has  the  reference  image  provided  by  an  RPV 
or  forward  observer.  The  reference  image  is  transmitted  back  to  the  artillery 
battery  and  loaded  in  the  shell.  This  reference  image  is  matched  by  the  smart 
artillery  shell  during  flight  and  provides  controls  to  guide  the  shell  into  the 
vicinity  of  the  targets.  The  scene  matching  provides  midcourse  corrections.  The 
shell  can  be  guided  by  scene  matching  on  the  background,  and  not  the  target,  thus 
easing  the  sensor  resolution  requirements.  The  scene  matching  here  is  more  dif¬ 
ficult  than  in  the  previous  case  because  larger  differences  in  perspective,  orien¬ 
tation,  and  scale  may  exist  between  the  reference  image  and  the  instant  image.  The 
second  phase  is  the  completely  autonomous  artillery  shell  in  which  the  reference 
image  is  deleted.  Here,  the  challenge  seems  to  lie  in  resolving  the  conflicts 
among  the  sensor-imago  processing  constraints  representing  different  parts  of  the 
scenario. 
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In  conclusion,  it  seems  that  laser-guided  scenarios  will  continue  to  be  af¬ 
fected  by  survivability  considerations.  At  the  other  end  of  the  spectrum,  a  number 
of  technical  issues,  not  associated  with  VHSIC,  need  to  be  resolved  with  regard  to 
the  semi-autonomous  and  autonomous  artillery  shells.  A  compromise  seems  to  be  the 
LOAL  scenario  in  which  the  sensor  and  image  processor  are  contained  in  a  small 
tactical  missile  (6  inch  diameter)  rather  than  an  artillary  shell  with  its  atten¬ 
dant  launch  problems.  This  same  scenario  restricts  the  magnitude  of  the  scene 
matching  problem  by  confining  the  original  mismatch.  Here,  VLSI/VHSIC  technology 
could  be  truly  viewed  as  a  technical  insertion  in  the  post  1985  period.  It  appears 
that  the  image  processing  state  of  the  art  will  advance  to  accommodate  the  LOAL, 
small  tactical  missile. 

4.2  HARDWARE  CONSTRAINTS 

The  purpose  of  this  section  is  to  describe  the  VHSIC  hardware  constraints 
placed  upon  implementation  of  such  systems.  In  particular,  VLSI  hardware  in  the 
form  of  gated  arrays  of  approximately  3000  gates  per  chip  and  VHSIC  hardware  in 
the  form  of  30,000  gate  chips  are  examined.  The  VLSI  chips  are  the  "glue"  chips 
such  as  multiplexers  and  memory  address  generators  which  form  the  connections  or 
"glue"  between  the  VHSIC  chips.  Consider  the  VLSI,  VHSIC  approach  to  the  through¬ 
put  problem  of  a  digital  signal  (image)  processor.  The  following  four  paragraphs 
are  excerpts  from  Westinghouse  work  described  by  reference  2. 

The  task  of  improving  the  processing  capability  of  a  signal  processor  consists 
largely  of  finding  ways  to  increase  data  throughput  rates.  A  most  important  area 
to  be  considered  is  that  of  circuit  technology,  with  the  aim  of  minimizing  the 
delay  associated  with  each  logic  gate.  To  this  end,  logic  circuitry  with  the 
highest  speed  practical  is  highly  desirable.  Another  important  area  is  that 
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of  circuit  integration,  which  when  applied  to  a  greater  degree  can  decrease  overall 
propagation  delay.  This  is  achieved  by  reducing  the  delay  associated  with 
propagation  of  signals  from  one  integrated  circuit  to  another  by  substitution  of 
faster,  on-chip  propagation  on  a  larger  LSI  circuit.  Increasing  the  degree  of 
circuit  integration  beyond  tnat  of  generally  available  MSI/LSI  circuits,  however, 
usually  entails  a  significant  additional  expense  and  design  cycle  time  when 
producing  fully  custom  VLSI  circuits.  The  VLSI  gated  array  at  3000  gates  per  chip 
is  a  good  solution  to  the  problems  of  expense  and  design  cycle  time.  The  last 
layers  of  metal  deposition  define  the  personality  of  the  gated  array  from  a 
multiplexer,  or  a  multiplier,  to  an  eight  bit  arithmetic  logic  unit.  A  combination 
of  diffusion  and  ion  implantations  define  a  fixed  set  of  transistors  and  resistors 
arranged  in  three  types  of  cell  groupings.  Each  type  of  cell  is  customized  by 
configuring  the  two-level  metal  interconnection. 

The  three  cell  types  are;  1)  internal  cell  logic,  2)  input/output  cell,  and  3) 
input  cell.  The  internal  logic  cell  consists  of  26  transistors  and  13  resistors, 
components  sufficient  to  configure  four  3-input  OR  gates  (or  ether  configurations 
totalling  up  to  14  inputs)  and  a  number  of  other  circuits.  The  internal  schematic 
is  shown  in  Figure  4-1.  Figure  4-2  is  an  example  of  a  circuit  configured  from  the 
internal  cell  components. 

The  input/output  cell  consists  of  12  transistors  and  eight  resistors  and  can  be 
configured  as  a  3-input  gate/buffer  for  input  to  the  chip  or  output  from  the  chip. 
The  input  cell  consists  of  seven  transistors  and  three  resistors  and  is  electri¬ 
cally  the  same  as  the  input  version  of  the  input/output  cell.  A  Westinghouse 
Emitter  Coupled  Logic  (ECL)  gated  array  is  composed  of  48  internal  cells  in  a  6x8 
matrix,  24  input/output  cells  and  16  input  cells.  Pen  plots  of  the  overall  array 
and  detail  of  one  corner  are  shown  in  Figures  4-3  and  4-4.  An  example  of  a  per¬ 
sonalized  gated  array  is  shown  in  the  micrograph  of  Figure  4-5. 


Figure  4-1.  Internal  Logic  Cell  Devices 
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Figure  4-4.  Universal  Array  Corner  Detail 
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Wnile  tne  ECL  version  hes  approximately  3bl)  qates  on  a  chip,  the  CMOS-SOS  version 
has  3000  (VLSI)  gates.  This  latter  version  can  embody  the  eight  bit  ALU. 

Before  considering  the  CMOS  SOS  gated  array  in  more  detail,  consider  a  review 
of  the  procedure  necessary  to  produce  a  gate  array.  This  will  provide  a  descrip¬ 
tion  of  the  real  advantage  of  a  gate  array. 

The  design  procedure  for  a  custom  VLSI  circuit  using  this  array  starts  with  the 
generation  of  a  logic  diagram  using  conventional  CMOS  SOS  notation.  Next  a  logic 
partition  is  done  using  cell  types  in  the  array  library.  Complete  logic  simulation 
of  the  circuit  is  done  with  computer  programs  of  a  CAD  system,  which  also  includes 
programs  for  test  generation,  fntercell  connections  between  appropriately 


Figure  4-b.  Gated  Array  Micrograph 
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placed  cells  are  made  on  a  computer-aided  graphics  system  either  manually  or  by 
means  of  a  computer  program  for  automatic  cell  placement  and  interconnect  routing, 
which  is  also  part  of  the  CAD  system.  The  two  metal  masking  levels  are 
subsequently  produced  for  the  gate  array  personal-  ization.  These  patterns  are 
then  applied  to  diffused  and  metallized  wafers  to  finish  the  custom  integrated 
circuit. 

As  mentioned  previously  one  personalization  of  the  CMOS  SOS  gated  array  is  an 
eight  bit  arithmetic  logic  unit  which  requires  approximately  0.33  watts  at  a  speed 
of  10  MHz.  In  a  leadless  pack  configuration,  the  chip  is  approximately  1.0  inches 
X  1.0  inches.  The  microcode  instruction  set  includes  a  full  complement  of  logical 
operations  plus  addition,  subtraction,  and  2's  complement  multiplication.  The 
latter  means  essentially  that  16  clock  cycles  are  necessary  to  multiply  two  eight 
bit  numbers.  The  chip  is  capable  of  running  faster,  but  the  power  requirements 
increase  accordingly.  For  the  purpose  of  this  analysis,  it  is  reasonable  to  start 
with  the  10  MHz,  0.33  watts  combination.  The  10  MHz  rate  means  that  10  x  10^ 
microcode  instructions  can  be  executed  per  second.  A  rough  estimate  of  board 
spacing  indicates  that  a  minimum  of  40  such  chips  can  be  placed  on  an  8.0  inch  x  10 
inch  board.  An  added  feature  of  this  personalization  is  the  large  quantity  of 
scratch  registers  above  the  ALU.  It  is  possible  to  accommodate  32  scratch 
registers  with  approximately  100  leads  for  this  chip  which  facilitates  the  imple¬ 
mentation,  for  example,  of  a  5x5  median  filter.  The  drawback  of  the  personal¬ 
ization  is  the  word  width.  The  dynamic  range  of  future  sensors,  particularly  the 
-^onn lissance  line  scanners  like  AAO-5,  ESSWACS,  and  LOREORS,  may  require  more 
-Its,  which  poses  additional  problems. 

■  !  'luher  level  are  the  30,000  gate  and  upward  VHSIC  chips  which  may  very 
••  in-ive  gate  arrays,  not  as  stand-alone  eight  bit  ALU's,  but  as  ancil- 
■  r.  1  multiplexer.  The  VHSIC  chips  will  operate  in  the 
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25MHz  and  upward  regime  and  the  ALU's  will  probably  be  at  least  16  bits  wide. 
Further,  they  will  be  compatible  with  higher  order  languages  and  architectures  such 
as  1750,  NEBULA,  and  ADA.  The  projections  for  the  ALU  chips  are  approximately  1.0 
inches  square  and  less  than  0.6  watts  required  power,  again  in  a  CMOS  technology. 

The  VHSIC  chip,  on  the  other  hand,  will  not  have  the  versatility  of  gate  array 
personalizations  in  reduced  design  cycle  time.  However,  the  repetitive  internal 
cell  structure  will  be  easily  accommodated  on  computer  aided  design  (CAD)  equipment. 

In  summary,  the  hardware  characteristics  and  constraints  for  VLSI  and  VHSIC 
implementations  have  been  described.  They  will  be  software  programmable  and  have  a 
very  extensive  set  of  arithmetic  logic  operations.  The  ALU  implementation  has  been 
specified  because  it  is  a  convenient  start  point.  The  issue  of  architecture  is  not 
addressed  directly  in  this  section;  however,  paragraph  4.4  and  4.5  deals  with  some 
architecture  considerations. 

4,3  FEASIBILITY  -  AN  OVERVIEW 

The  purpose  of  this  paragraph  is  to  present  a  quick  look  at  the  VLSl/VHSIC 
implementation  of  image  processing  algorithms,  and  to  indicate  feasibility. 

Certain  assumptions  are  made,  based  on  prior  implementations  in  MSI  (medium  scale 
integrated)  and  CCD  (charge-coupled  device)  circuits. 

Based  on  experience  with  Auto-Q  implementation  in  MSI  at  Westinqhouse,  and 
additional  work  in  gate  arrays  and  VHSIC  with  the  Maryl and-DARPA  algorithms,  we 
might  assume  that  the  implementation  of  a  sufficient  set  of  image  processing  algo¬ 
rithms  to  perform  the  intelligent  tracking  functions  can  be  implemented  as  approxi¬ 
mately  1500  microcode  instructions.  Further,  assume  that  all  1500  instructions 
must  be  executed  per  pixel.  These  assumptions  make  no  allowance  for  bandwidth 
reduction  as  the  image  passes  through  feature  extraction,  segmentation,  and  clas¬ 
sification,  Further,  no  allowance  is  made  for  selectively  searching  the  image  nor 
reducing  the  image  frame  processed  as  close-in  tracking  is  performed. 
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The  instruction  set  with  the  above  consideration  included  may  decrease  by  as  much 
as  50  percent.  A  50  percent  increase  is  probably  not  very  likely. 

From  paragraph  4.1,  the  processing  rate  for  the  small  tactical  missile  is 
1.9x10^  pixels  per  second.  Executing  1500  micro  instructions  per  pixel  yields  a 

9 

throughput  of  2.35  x  10  micro  instructions  per  second.  For  the  VLSI  implemen- 
tation  of  eight  bit  wide  ALU's,  the  throughput  of  each  processor  is  10x10°  micro 
instructions  per  second.  Hence,  .285  x  10  or  285  ALU's  are  required  to  meet  the 
data  'ate.  This  simple  division  assumes  some  sort  of  parallel  pipeline  architec¬ 
ture,  whicn  is  a  brute  force  attack  on  the  problem.  Assume  40  ALU  chips  can  be 
accomodated  on  a  8.0  inch  x  10.0  inch  board,  and  80  ALU  chips  on  a  board  pair. 
Further,  assume  there  is  0.6  inches  between  board-pair  centers.  Then  320  ALU  chips 
can  be  placed  on  4  board  pairs  for  a  total  volume  of  144  cubic  inches.  This  is 

3 

triple  the  volume  derived  for  the  CCO  implementation.  However,  in  the  CCD  case 
the  design  was  refined  a  number  of  times.  In  this  case,  the  VLSI  implementation 
appears  feasible  with  a  number  of  very  conservative  assumptions.  The  power 
required  is  approximately  86  watts  which  is  a  factor  of  three  above  the  required 
power  as  described  in  paragraph  4.1. 

For  the  same  parallel  pipeline  architecture,  the  VHSIC  implementation  requires 
2850  X  10^/25x10^  =  114  ALU's.  Assume  that  40  such  chips  would  fit  on  a 
8.0x10.0  inch  board.  The  volume  is  less  than  57  cubic  inches  with  a  required  power 
of  less  than  70  watts. 

Consider  now  a  relaxation  of  one  of  the  conservative  assumptions.  Assume  the 
intelligent  tracker  system  concept  of  cueing  every  5th  frame  and  tracking  in  the 
intervening  four  frames.  For  ease  of  discussion,  suppose  one  portion  of  the  hard¬ 
ware  handles  cueing,  change  detection,  histogram  formation,  and  close-in  tracking. 
The  other  performs  correlation  tracking  and  aimpoint  selection.  Further, 


the  former  is  composed  of  1300  instructions  and  the  latter  is  composed  of  200 
instructions.  Roughly,  the  first  unit  requires  one  fifth  of  the  volume  and  speed 
just  calculated  and  the  latter  requires  one  seventh.  In  VLSI,  the  required  volume 
and  power  is  48  cubic  inches  and  28  watts.  In  VHSIC  implementa-  tion,  the  required 
volume  and  power  are  19  cubic  inches  and  25  watts.  This  refine-  ment  puts  the 
VLSI/VHSIC  implementation  within  range  of  the  small  tactical  missile  requirements. 
The  power  supply  must  also  have  long  shelf  life. 

In  conclusion,  a  rough  calculation  based  on  conservative  assumptions  and  no 
implementation  refinements  tends  to  indicate  that  VLSI/VHSIC  hardware  will  produce 
the  required  volumes  and  power  required  by  the  small  tactical  missile  case  and  can 
be  software  programmable.  Smart  artillery  shells  would  appear  to  require  phase  II 
VHSIC  technology  -  and,  potentially,  the  development  of  custom  designed  VHSIC 
processor  chips. 

4.4  PARALLEL  ORGANIZATION  OF  SEGMENTATION  ALGORITHMS 

To  gain  speed  in  performing  segmentation  algorithms,  the  concept  of  a  parallel 
pipeline  architecture  is  often  suggested.  However,  little  work  has  been  published 
in  implementing  such  an  organization  for  segmentation.  An  approach  to  the  problem 
is  proposed  in  ttiis  section. 

The  concept  of  a  parallel  organization  calls  for  dividing  the  image  into_  verti¬ 
cal  strips  and  assigning  a  processor  to  each  strip  as  shown  in  Figure  4-6.  Each 
processor  is  responsible  for  segmenting  its  own  vertical  strip  of  the  image.  The 
problem  arises  when  an  object  lies  across  an  artificial  boundary  created  by  the 
architecture.  This  boundary,  of  course,  does  not  exist  in  the  image.  Further, 
this  problem  can  be  overcome  in  a  parallel  organization  of  the  median  filter,  edge 
extraction,  and  non-maximum  suppression  operators.  All  that  is  necessary  to  accom¬ 
modate  a  parallel  organization  for  these  operators  is  to  overlap  the  strips 
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Figure  4-6.  Parallel  Organization 


by  one-half  the  width  of  the  moving  window.  This  allows  each  processor  to  access 
the  neighboring  pixels  in  adjacent  strips  which  it  needs  to  process  the  edge  pixels 
in  its  strip  as  the  center  of  a  moving  window.  For  the  segmentation  algorithms  the 
problem  is  more  complicated.  As  an  example  consider  the  Connected  Components 
algorithm  described  in  Appendix  A  of  the  First  Quarterly  Report. 

The  purpose  of  the  Connected  Components  algorithm  is  to  segment  the  image  data 
stream  into  smaller  domains.  Each  small  domain  includes  a  single  object  in  the 
image  plane.  This  algorithm  distinguishes  between  objects  and  isolates  regions  so 
that  classification  statistics  can  be  obtained. 

Assume  that  the  original  image  has  been  thresholded  and  the  result  is  in  binary 
form  with  gray  levels  exceeding  the  threshold  t.  shown  as  I's  in  Figure  4-7.  Two 
image  lines  are  retained  in  memory  so  that  each  pixel  can 

examine  its  neighbors  to  the  left  and  right  and  above  and  below.  No  diagonal  con¬ 
nections  are  permitted  under  this  convention,  and  an  adjacent  (horizontal  or 
vertical)  pixel  must  be  occupied  in  order  to  make  a  connection.  No  skips  or  gaps 
are  allowed  and  the  computations  start  one  pixel  in  from  the  edge.  In 
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Figure  4-8,  we  have  begun  the  process  of  labeling  the  connected  components  of 
Figure  4.7.  At  this  point  in  the  process,  there  are  four  distinct  regions.  A,  B, 

C,  and  D,  The  only  possible  connection  between  regions  B  and  C  (to  this  point)  is 
through  a  diagonal,  which  is  not  allowed.  Computations  for  the  fourth  row  are  seen 
in  Figure  4-9.  Here  there  is  an  equivalence  between  regions  B  and  C  and  an  equiva¬ 
lent  statement,  B  =  C,  is  carried  along.  At  tne  end  of  the  sixth  row,  there  is 
another  connection  between  C  and  D  (C  =  D)  and  all  the  regions  are  completed  as 
seen  in  Figure  4-10. 

If  the  above  process  is  repeated,  using  the  partitioned  image  as  shown  in 
Figure  4-6,  the  result  is  shown  in  Figure  4-11.  The  equivalence  will  not  be  dis¬ 
covered.  Figure  4-11  shows  more  sub-regions  comprising  the  objects.  Let  us  assume 
for  the  moment  that  the  equivalences  within  a  partitioned  strip  have  been  dis¬ 
covered  and  corrected.  The  result  appears  in  Figure  4-12. 
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Figure  4-8.  Computations  for  Second  Row 
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Figure  4-9.  Computations  for  the  Fourth  Row 
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Figure  4-10.  Completed  Image 


4-16  - 


Kl 

■ 

■ 

R 

m 

n 

H 

H 

■ 

R 

H 

H 

R 

H 

H 

H 

R 

H 

R 

H 

R 

B 

R 

m 

H 

H 

R 

H 

R 

R 

R 

H 

IR 

R 

R 

H 

R 

R 

R 

R 

R 

B 

IR 

m 

H 

H 

R 

H 

R 

H 

R 

H 

IR 

m 

■ 

R 

R 

R 

R 

□ 

B 

B 

IR 

m 

■ 

R 

R 

R 

R 

R 

H 

B 

Figure  4-11.  Completed  Image  Without  Equivalence  with  Partitioning 

Consider  an  approach  to  resolving  the  dilemma  posed  by  Figure  4-12  which 
involves  a  straightforward  computation  of  connected  component  and  equivalence 
statements  and  a  different  memory  design.  As  seen  in  Figure  4-7,  8,  9,  11  the 
natural  movement  of  the  connected  components  is  from  left  to  right,  across  an 
entire  line  before  the  next  line  is  analyzed.  This  is  opposite  from  partitioning 
the  image  into  vertical  strips.  However,  if  there  is  a  directed  line  connecting 
adjacent  processors  from  the  register  stack  of  one  to  the  other  such  that  the 
results  of  processor  number  1  are  passed  to  processor  number  2,  then  a  line  of  the 
image  can  be  executed  as  shown  in  Figure  4-13.  Processor  number  1  is  associated 
with  vertical  strip  number  1,  processor  number  2  is  associated  with  vertical  strip 
number  2  and  so  on.  In  practice,  processor  number  1  processes  line  number  1  of 
vertical  strip  number  1  and  hands  it  to  processor  number  2  which  processes  line 
number  1  of  vertical  strip  number  2.  In  the  meantime,  processor  number  1  is  now 
processing  line  number  2  of  vertical  strip  number  1.  In  this  way,  the  pipeline  is 
filled  such  that  there  is  no  waiting. 
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Figure  4-12.  Completed  Image  without  Equivalence  Across  Partitions 
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Figure  4-13.  Execution  for  Parallel  Organization 


In  conclusion,  a  directed  line  between  processors  allows  the  simultaneous  hori¬ 


zontal  transfer  of  data  between  processors.  This  permits  a  parallel  pipeline 
organization  to  avoid  the  problem  of  segmenting  across  artificial  boundaries 
created  by  the  parallel  organization.  This  is  a  variation  on  the  cellular  array 
architecture  for  image  processing  analyzed  by  Westinghouse  -  Maryland  and  sponsored 
by  OARPA  -  NV&EOL. 

4.5  SUPERSLICE  -  A  CLOSER  LOOK 

The  Superslice  concept  and  CCD  implementation  were  developed  by  Maryland  - 

Westinghouse  under  sponsorship  of  DARPA  -  NV&EOL.  In  the  final  report,  Ref.  5.3, 

the  threshold  selection  philosophy  was  stated  on  page  4-20  as  follows: 

"...  The  philosophy  here  is  not  to  attempt  to  find  a  single 
threshold  but  rather  to  use  a  set  of  thresholds  which  span  the 
range  of  gray  scale.  For  the  NVL  FLIR  data,  fifteen  (15)  gray 
scale  levels  represented  the  gray  scale  ranges  and  selecting  a 
threshold  every  three  gray  scales  was  deemed  to  give 
satisfactory  target  detections  by  the  University  of  Maryland." 

Although  this  report  was  written  in  March  of  1978  and  represented  the  state  of 

the  art,  cuers  are  now  being  held  responsible  for  target  detections  over  a 

broader  and  more  difficult  set  of  targets.  For  example,  partially  occluded 

targets  and  target  in  heavy  clutter  are  notable.  What  these  mean  to  Super- 

slice  from  the  Intelligent  Tracker  experience  is  that  the  threshold  interval 

of  three  gray  levels  is  too  wide  for  the  expanded  target  set.  In  some  cases 

there  are  only  one  or  two  levels  at  which  the  target  is  cleanly  segmented  and 

it  is  either  merged  or  does  not  appear  at  all  for  thresholds  on  either  side  of 

the  narrow  range  of  thresholds.  Further,  aides  such  as  kurtosis  (histogram 

peakedness)  and  percent  of  matches  above  66%  were  developed  to  indicate 

detections  where  a  edge/perimeter  point  maximum  did  not  exist. 
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In  conclusion,  the  VLSI/VHSIC  hardware  implementation  will  have  to  cope  with 
the  possibility  of  a  variable  number  of  thresholds  for  a  single  frame.  However,  it 
may  not  be  necessary  to  obtain  throughput  rates  commensurate  with  fifteen  thres¬ 
holds  on  every  frame.  For  example,  one  can  conceive  of  utilizing  fifteen  thres¬ 
holds  in  a  search  mode  over  several  frames  if  30  frames  per  second  rates  are 
achieved,  i.e.  seven  or  eight  thresholds  per  frame.  This  is  not  necessarily 
required  in  a  track  mode.  Further,  the  idea  of  average  power  and  peak  power  has 
application  here,  because  the  search  mode  may  require  higher  clock  rates  and 
increased  power  (peak  power)  for  short  periods  of  time.  On  the  other  hand,  the 

track  mode  may  be  more  appropriate  for  longer  time  periods,  requiring  lower  clock 

rates  and  lower  power.  Hence,  the  average  power  is  within  the  allowable  power 
density  (watts/in  )  for  VLSI/VHSIC  applications.  The  Intelligent  Tracker  concept 
has  the  advantage  of  low  average  power  while  the  cuer  rate  can  be  increased  as 
required . 

4.6  CONNECTED  COMPONENTS  AND  WORD  WIDTH 

The  purpose  of  this  section  is  to  broach  the  idea  of  implementing  bit  plane 
architecture  in  the  traditional  ALU.  Cellular  arrays  are  also  discussed. 

Bit  plane  architecture  is  defined  here  as  operating  on  a  word  by  operating  on 
each  bit  separately.  For  example,  in  a  traditional  ALU,  an  AND  between  two 
eight-bit  words  results  in  a  one  bit  flag  which  describes  the  result  of  ANDing  the 
two  words.  A  bit  plane  implementation  results  in  eight  flags  each  of  which  sig¬ 
nifies  the  status  of  each  of  the  eight  pairs.  If  this  sort  of  approach  could  be 

applied  to  the  Connected  Component  algorithm,  then  it  is  conceivable  that  two  16 
bit  ALU's  could  perform  the  segmentation  for  fifteen  superslice  thresholds  in 
parallel.  Recall  that  the  Connected  Components  algorithm  is  performed  in  binary 
space.  However,  it  also  requires  cojnditional  branching  depending  on  existence  of 
matching  topologically  related  components.  Currently,  the  hardware  implementation 
calls  for  sequential  execution  of  successive  Connected  Component  thresholds. 
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Cellular  arrays  require  an  array  of  processors  as  dense  as  the  image  array 
itself.  Each  programmable  processor  unit  must  be  capable  of  handling  a  word  width 
representing  the  dynamic  range  of  each  pixel.  Each  processor  communicates  to  each 
of  its  eight  neighbors  along  directed  lines.  Although  cellular  arrays  put  extra¬ 
ordinary  demands  on  chip  fabrication,  some  of  ideas  can  be  adopted  and  are  useful. 
An  example  of  this  is  the  parallel  implementation  of  segmentation  algorithms  as 
described  in  Section  4.4  of  this  report. 

In  conclusion,  the  bit  plane  approach  appears  to  be  incapable  of  handling 
simultaneous  branching  instructions  within  a  conventional  ALU.  One  concept  was 
adopted  from  cellular  arrays  and  is  described  in  Section  4.4. 

4.7  FUTURE  DIRECTIONS 

With  regard  to  hardware  implemenation,  the  intent  is  to  continue  with  the 
VLSI/VHSIC  implementation  by  removing  more  of  the  conservative  assumptions. 


-  4-21 


b.U  RtFEkENCES 


b.l  Ea9e  Point  Linking  Using  Convergent  Evioei .e 

D.L.  MUgrcim,  page  bb,  Proceeaings  DAkPA  Image 
Unaerstanoing  ^iorksnop,  November  197b 

b.2  LSI  universal  Array  Circuits  for  Higti  Speeo 
Programmable  Processors,  Natalie,  M.R.  b 
Brooks,  C.U.  1978  NAECON,  Dayton,  Ohio. 

5.3  Algorithms  ana  Hardware  Technology  for  Image 
Recognition,  hilgram,  D.L.,  Rosenfelo,  A., 
Willett,  T.J.,  &  Tisoale*.  G.T .  Final  Report 
unoer  Contract  DAAG53-76C-0l3b,  March  31,  1978 


-  b-i 


